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Prognostic and Diagnostic Markers for cell proliferative disorders of the breast tissues 

The present invention relates to prognostic and diagnostic markers for cell proliferative disor- 
ders of the breast tissues. The present invention therefore provides methods and nucleic acids 
for the analysis of biological samples for features associated with the development of breast 
cell proliferative disorders. 

Background of the invention 

. This patent application relates to diagnosis and prognosis of breast cancer. Today. involvement 
of axillary lymph nodes and tumour size are the most important prognostic factors in .breast 
cancer. Although the presence or absence of metastatic involvement in the axillary lymph 
nodes is the most powerful prognostic factor available for patients with primary breast cancer, 
it is only an indirect measure reflecting the tumours' tendency to spread. In approximately 
one-third of .women with breast cancer and negative lymph nodes the disease recurs, while 
about one-third of patients with positive lymph nodes are free of recurrence ten years after 

• . loco-regional therapy. These data highlight the need for more sensitive and specific prognos- 
tic indicators, ideally reflecting the presence or absence of tumour-specific alterations in the 
bloodstream that may eventually even after years lead to metastasis. It is now widely accepted 
that adjuvant systemic therapy substantially improves disease-free and overall survival in both 
pre- and postmenopausal women up to the age of 70 years with lymph node-negative or 
lymph node^positive breast cancer (Early Breast Cancer Trialists' Collaborative Group Tam- 
oxifen for early breast cancer: an overview of the randomised trials. Early Breast Cancer Tri- 
alists' Collaborative Group. Lancet, 351: 1451-1467, 1998.2, 3). It is also generally accepted 
that patients with poor prognostic features benefit the most from adjuvant therapy, whereas 
some patients with good prognostic features may be overrated (Goldhirsch et al.: Meeting 
highlights: International Consensus Panel on.the Treatment of Primary Breast Cancer. Sev- 
enth International Conference on Adjuvant Therapy of Primary Breast Cancer. J. Clin. Oncol., 
.19: 3817-3827, 2001.). Moreover many other factors have been investigated for their potential 
to predict disease outcome, but in general they have only limited predictive value. Recently, 
interesting prognostic parameters including gene-expression profiles, cell cycle regulating 





proteins and occult cytokeratin-positive metastatic cells in the bone marrow have been added 
to the list of prognostic factors, but their prognostic relevance needs to be further evaluated. 

Changes in the status of DNA methylation, -known as epigenetic alterations, are one of the 
most common molecular alterations in human neoplasia, including breast cancer 
(Widschwendter and Jones: DNA methylation and breast carcinogenesis. Oncogene, 21: 
5462-5482, 2002). Cytosine methylation occurs after DNA synthesis by enzymatic transfer of 
a methyl group from the methyl donor S-adenosylmethionine to the carbon-5 position of cyto- 
sine. Cytosines are methylated in the human genome mostly when located 5' to a guanosine. 
Regions with a high G:C content are so-called CpG islands. It has been increasingly recog- 
nized over the past four to five years that the CpG islands of a large number of genes, which 
are mostly unmethylated in nprmal tissue, are methylated to. varying degrees in human can- 
cers, thus representing tumor-specific alterations: The presence of abnormally high DNA con- 
centrations in the serum of patients with various malignant diseases was described several 
years ago. The discovery that cell-free DNA can be shed into the bloodstream has generated 
great interest. Numerous studies have demonstrated tumor-specific alterations in DNA recov- 
ered from plasma or serum of patients with various malignancies, a finding that has potential 
for molecular, diagnosis and prognosis. The nucleic acid markers described in plasma and se- . 
rum- include oncogene mutations, microsatellite alterations, gene rearrangements and epige- 
netic alterations, such as. aberrant promoter hypermethylation (Anker et al.: Detection of cir- 
culating tumour DNA in the blood (plasma/serum) of cancer patients. Cancer Metastasis Rev., 
18: 65-73, 1999). During recent years some studies have reported cell-free DNA in se- 
rum/plasma of breast cancer patients at diagnosis (for example: Silva et al.: Presence of tumor 
DNA in plasma of breast cancer patients: cHnicopathological correlations. Cancer Res., 59: 
3251-3256, 1999) and in some cases persistence after primary therapy (for example: Silva et 
al.: Persistence of tumor DNA in plasma of breast cancer patients after mastectomy. Ann. 
Surg. Oncol, P: 71-76, 2002). Nevertheless an increasing number of studies have reported the 
' presence of methylated DNA in serum/plasma of patients with various types of malignancies, 
including breast cancer, and the absence of methylated DNA in normal control patients (for 
example: Wong et al: Detection of aberrant pl6 methylation in the plasma and serum of liver 
cancer patients. Cancer Res., 59: 71-73, 1999). So far, only few studies have addressed the 
prognostic value of these epigenetic alterations in patients' bloodstream (Kawakami et al.: 
Hypermethylated APC DNA in plasma and prognosis of patients with esophageal adenocarci- 
noma. J. Natl. Cancer Inst., 92: 1805-1811, 2000; Lecomte et al.: Detection of free-circulating 





^.associated DNA in plasma of colored cancer patients and its association wift prog- 
nosis. Int. J. Cancer, 100: 542-548, 2002). 

„ will be appreciafcd by those skilled in the tnt that tore exists a continning need ,0 improve 
lids of early detection classification and treatment of breast cancers. In tins app^abon 
prognostic and diagnostic DNA meftylation-based markers for breast cancer are ftsclosed. 

5-meftylcy.osine positions eanno, bo identified by sequencing since 5-mefty.cy.osine ta : the 
ame blaring behavior as cytosine. Moreover, tbe epigenetie informal earned by - 
Tft^oL is completely lost dnring PCR amplification. Current* tbe most fiaquentiy 
21 for analysing DNA for 5-methylcytosine is based upon fte specie raachon o 
bisuffite wift cytosine .which,, npon subsequent hydrolysis, is converted ourac.1 

S corresponds * thymidine in its base pairing, behaviour: However, 5-methylcytosme 
21 unm Led under ftese conditions. Consequently, the original DNA ^converted - 

^ b byhridisation behaviour, can now be detected as the only remaining oy^eu^ 
U- molecular biological techniques, for example, by amplification and hyhnmsa n » 
sequencing. AU of these techniques are based on base pairing which - now b «*y~ 
p, ited. in term* of sensitivity, fte prior art is defined by a method winch — 
I be analysed in: an agarose matrix, thus preventing fte diffusron and rent— of fte 
DNA (elite only reacts wift singie-stranded DNA), and which replaces ******* 
^purification steps wift ft* dialysis. (O.ek A, Oswald J, Walter , A moftfied and ^ 
% meftod for bisu,phi.e based cytosfte mediation analysis. NucHc ^™ 
Dec 15-24(24):5064-6). Using mis meftod, it is possible to analyse mdmdual cnUs, winch 
TJ1 1 potential of fte meftod. However, currently only individual regions of a lengft 

of lible methylation even, is not possible. However, this meftod c»not 
■ Z small fragments from small sample quantities either. These are lost through fte matnx m 
spite ofthe diffusion protection. 

A, overview of fte further knowu methods of detecting 5-meftylcyiosine may be gaftered 
. ^n the following review article: Fraga and Esteller: DNA Mediation: A Profile of Meft- 
ods and AppUcations. Bibtechniques 33:632-649, Sept 2002. 



To date, barring few exceptions (e.g., Zeschnigk M, Lich C, Buiting K, Doerfler W, 
Horsthemke B. A single-tube PCR test for the diagnosis of Angelman and Prader-Willi syn- 
drome based on allelic mediation differences at the SNRPN locus. Eur J Hum Genet. 1997 
Mar-Apr;5(2):94-8) the bisulfite technique is only used in research. Always, however, short, 
specific fragments of a known gene are amplified subsequent to a bisulfite treatment and ei- 
ther completely sequenced (Olek A, Walter J. The pre-implantation ontogeny of the H19 
methylation imprint. Nat Genet. 1997 Nov;17(3):275-6) or individual cytosine positions are 
detected by a primer extension reaction (Gonzalgo ML, Jones PA. Rapid quantitation of 
methylation differences at specific sites using methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE). Nucleic Acids Res. 1997 Jun 1 5;25(12):2529-3 1 , WO 95/00669) or 
by enzymatic digestion (Xiong Z, Laird PW. COBRA: a sensitive and quantitative DNA 
methylation assay. Nucleic Acids Res. 1997 Jun 15;25(12):2532-4). In addition, detection by 
hybridisation has also been described (Olek etal., WO 99/28498). 

Further publications dealing with the use of the bisulfite technique for methylation detection 
in individual genes are: Grigg G, Clark S ; Sequencing 5-methylcytosine residues in genomic 
DNA. Bioessays. 1994 Jun;16(6):431-6, 431; Zeschnigk M, Schmitz B, Dittrich B, Buiting K, 
Horsthemke B, Doerfler W. Imprinted segments in the human genome: different DNA meth- 
ylation patterns in the Prader-Willi/Angelman syndrome region as determined by the genomic 
sequencing method. Hum Mol Genet. 1997 Mar;6(3):3 87-95; Feil R, Charlton J, Bird AP, 
Walter J, Reik W, Methylation analysis on individual chromosomes: improved protocol for 
bisulphite genomic sequencing. Nucleic Acids Res. 1994 Feb 25;22(4):695-6; Martin V, 
Ribieras S, Song-Wang X, Rio MC, Dante R. Genomic sequencing indicates a correlation 
between DNA hypomethylation in the 5' region of the P S2 gene and its expression in human 
breast cancer cell lines. Gene. 1995 May 19;157(l-2):261-4; WO 97746705, WO 95/15373, 
and WO 97/45560. 

An overview of the Prior Art in oligomer array manufacturing can be gathered from a special 
edition of Nature Genetics (Nature Genetics Supplement, Volume 21, January 1999), pub- 
lished in January 1999, and from the literature cited therein. 

Fluorescently labelled probes are often used for the scanning of immobiUsed DNA arrays. 
The simple attachment of Cy3 and Cy5 dyes to the 5'-OH of the specific probe are particularly 
suitable for fluorescence labels.. The detection of the fluorescence of the hybridised probes 



may be carried out, for example via a confocal microscope. Cy3 and Cy5 dyes, besides many 
others, are commercially available. 



Matrix Assisted Laser Desorption Ionisation Mass Spectrometry (MALDI-TOF) is a very 
efficient development for the analysis of biomolecules (Karas M, Hillenkamp F. Laser de- 
sorption ionisation of proteins with molecular masses exceeding 10,000 daltons. Anal Chem. 
1988 Oct 15;60(20):2299-301). An analyze is embedded in a light-absorbing matrix. The ma- 
trix is evaporated by a short laser pulse thus transporting the analyte molecule into the vapour 
phase in an unfragmented manner. The analyte is ionised by collisions with matrix molecules. 
An applied voltage accelerates the ions into a field-free flight tube. Due to their different 
masses, the ions are accelerated at different rates. Smaller ions reach the detector sooner than 
bigger ones. 

MALDI-TOF spectrometry is excellently suited to the analysis of peptides and proteins. The 
analysis of nucleic acids is somewhat more difficult (Gut I G, Beck S. DNA and Matrix As- 
sisted Laser Desorption Ionisation Mass Spectrometry. Current . Innovations and Future 
Trends. 1995, 1; 147-57). The sensitivity to nucleic acids is approximately 100 times worse 
than to peptides and decreases disproportionally with increasing fragment size. For nucleic 
acids having a multiply negatively charged backbone, the ionisation process via the matrix is 
considerably less efficient. In MALDI-TOF spectrometry, the selection of the matrix plays an 
eminently important role. For the desorption of peptides, several very efficient matrixes have 
been found which produce a very fine crystallisation. There are now several responsive ma- 
trixes for DNA, however, the difference in sensitivity has not been reduced. The difference in 
sensitivity can be reduced by . chemically modifying the DNA in such a manner that it be- 
comes more similar to a peptide. Phosphorothioate nucleic acids in which the usual phos- 
phates of the backbone are substituted with thiophosphates can be converted into a charge- 
neutral DNA using simple alkylation chemistry (Gut IG, Beck S. A procedure for selective 
DNA alkylation and detection by mass . spectrometry. Nucleic Acids Res. 1995 Apr 
25;23(8):1367-73). The coupling of a charge tag to this modified DNA results in an increase 
in sensitivity to the same, level as that found for peptides. A further advantage of charge tag- 
ging is the increased stability of the analysis against impurities, which make the detection of 
unmodified substrates considerably more difficult. 



Genomic. DNA is obtained from DNA of cell, tissue or other test samples using standard 
methods. This standard methodology is found in references such as Fritsch and Maniatis eds., 
Molecular Cloning: A Laboratory Manual, 1989. 

Description 

the present invention provides methods and nucleic acids for the analysis of biological sam- 
ples for features associated with the development of breast cell proliferative disorders. The 
invention is characterised in that the nucleic acid of at least one member of the group of genes 
according to Table 1 is/are contacted with a reagent or series of reagents capable of distin- 
guishing between methylated and non methylated CpG dinucleotides within the genomic se- 
quence of interest. The present invention makes available a method for ascertaining genetic 
and/or epigenetic parameters of genomic DNA. The method is for use for the determining the 
prognosis of breast cell proliferative disorders. The invention presents improvements over the 
state of the art in that by means of the methods and compounds described herein a person 
skilled in the art may carry out a sensitive and specific detection assay of cellular matter com- 
• prising cancerous breast tissue. This is particularly useful as it allows the analysis of samples 
of body fluids which may contain only a minimal amount of cell proliferative disorder cellular 
matter, and enables the detection of said cells and the identification of the organ from which 
they originated (in this case breast). To date there are no known clinically utilisable means for 
the detection of breast cancer using genetic methylation markers to analyse bodily fluid.sam- 
ples, such as blood, lymphatic fluids, nipple aspirate and plasma. The generated information 
is useful in the selection of a treatment of the patient. If a positive prognosis is determined a 
further treatment might be redundant, while in a case of a poor prognosis a stronger treatment 
might be necessary. 

Furthermore, the method enables the analysis of cytosine methylations and single nucleotide 
polymorphisms. 

The genes that form the basis of the present invention are preferably to be used to form a 
"gene panel", i.e. a collectibn comprising the particular genetic sequences of the present in- 
vention and/or their respective informative methylation sites. The formation of gene panels 
allows for a quick and specific analysis of specific aspects of breast cancer. The gene panel(s) 
as described and employed in this invention can be used with surprisingly high efficiency for 



the diagnosis, treatment and monitoring of and the analysis of a predisposition to breast ceU 
proliferative disorders. 

to addition, the use of multiple CpG sites from a diverse array of genes allowsfor a relative* 
high degree of sensitivity and specifioity in comparison to single gene diagnosho and detec- 
tion tools. Of the genes known ,o be speeifieally methylated in breast eanoer, me particular 
combination of me genes according to me invention provides for a particularly sensitive and 
specific means for the identification of cell proliferative disorders of breast tissues. 

The object of the invention is most preferably achieved by means of the analysis of the meth- 
yl ation patterns of one or a combination of genes taken from the group taken from the group 
ESR1 APC HSD174B4, HIC1 and RASSF1A (see, for example, Table 1) and/or their regu- 
latory'regions. The invention is character^ in that the nucleic acid of one or a combination 
of genes taken from the group ESR1, APC, HSD174B4, HIC1 and RASSF1A are contacted 
with a reagent or series of reagent capable of distinguishing between methylated and non 
methylated CpG dmucleotides within the genomic sequence of interest. 

' The object of me invention can also be achieved by the analysis of the CpG methylation of 
' one or a pluratity of any subset of the group of genes ESR1, APC, HSD174B4, HIC1 and 
RASSF1 A, in particular the following subsets are preferred: 

.. RASSF1A and APC, 
. RASSF1A, and 
. APC 

The present invention makes available a method for ascertaining genetic and/or epigenetio 
parameters of genomic DNA The method is for use in (he improved diagnosis, treatment and 
monitoring of breast cell proliferative disorders. 

The disclosed invention further provides a method for detennining me phenotype of a subject 
with a breast cell proliferative disorder comprising 

a) obtaining a biological sample containing genomic DNA from said subject, 

b) analysing the methylation partem of one or more target nucleic acids comprising one or a 
combination of the genes taken from me group consisting of ESR1, APC, HSD.74B4, HIC1 



and RASSF1A and/or their regulatory regions by contacting at least one of said target nuclerc 
acids in the biological sample obtained torn said subject with at least one reagent, or senes of, 
agents that distinguishes between methylated and non-methylated CpG dinucleotides, and 
c) determining the pheno W e of the individual by comparison to two known phenotypes, a 
firs, phenotype characterised by hypermethylation of the target nucleic acid and poor progno- 
sis as relative to a second phenotype characterised by hypomethylation of me analysed target 
nucleic acid and better prognosis 

The DNA may be obtained from any form of biological sample including but not limited to 
cell lines, histological slides, biopsies, tissue embedded in paraffin, breast tissues, blood, 
piasma, lymphatic fluid, lymphatic tissue, duct cells, ductal lavage fluid, nipple asptrauon 
fluid and combinations thereof. Genomic DMA must men be isolated from the sample.usmg 
any means standard in toe art. The isolated DNA is treated with at least one reagent, or senes 
of reagents that distinguishes between methylated and non-methylated CpG dinucleotides. 
This may be carried out by any means standard in the art including the use of restriction en- 
donucleases. However, it is preferably carried out with bisulfite (sulfite, disunite) and subse- 
quent alkaline hydrolysis which results in a conversion of non-methylated cytosine nucleo- 
bases to uracil or to another base which is dissimilar to cytosine in terms of base pamng be- 
haviour If bisulfite solution is used for the reaction, then an addition takes place at the non- 
methylated cyiosine bases. Moreover, a denaturafing reagent or solvent as well as a radrcal 
interceptor must be present. A subsequent alkaline hydrolysis then gives rise to the conversron 
of non-methylated cytosine nncleobases to uracil. The converted DNA is then useti for tire 
detection of methylated cytosines. The methylation status of one ormore of the genes ESR1, 
AFC HSD174B4, HIC1 and RASSF1 Aand/or their regulatory regions is then analysed. Tins 
analysis may be carried out by any means standard in the art including toe above descnbed 
techniques. In the final step of the method the metoylation pattern of toe DNA obtained from 
the subject is compared to that of two known phenotypes. The first phenotype is character**, 
by hypennethylation or methylation of the target nucleic acid turd poor prognosis as relative 
to n second phenotype characterised by hypomethylation or no methylation of the analysed 
target nucleic acid and bettor prognosis. It is particularly preferred that the genes AFC 
and/RASSFIA are analysed; By determining which of the two phenotypes the subject belongs 
to it is possible to determine a suitable treatotent to her breast cell proliferative disorder. 




The m ethod according to the invention may be used for the analysis of a wide variety of cell 
proliferative disorders of the breast tissues including, but not limited to, ductal carcmoma ft. 
stn, lobular carcinoma, colloid carcinoma, tubular carcinoma, medullary carcinoma, mem- 
plastic carcinoma, intraductal carcinoma in situ, lobular carcinoma ft, situ arid paprllary carer- 
aoma in situ. ^ 

Furthermore, the method enables the analysis of cytosine mediations .and single nucleotide 
polymorphisms within said genes. 

The object of the invention is achieved by means of tire analysis of the methylation patterns of 
one or more of the genes ESR1, APC, HS D1 74B4, HIC1 and RASSF1A and/or tireir regula- 
tory.regions. In a particularly preferred embodiment the sequences of said genes compnse 
SEQ ID NOs: 1 to 5 and sequences complementary thereto. 

The object of the invention may also be achieved by analysing the methylation patterns of one 
or more genes taken from the following subsets- of said aforementioned group of genes. In one 
enrolment the object of the invention is achieved by analysis of tire methylation patterns of 
the genes RASSF1A and APC and wherein it is tourer preferred that the sequence of sard 
genes comprise SEQ ID NOs: 5 and 3, respectively. In a further embodiment tire object of the 
invention is achieved by analysis of the metirylation patterns of tine gene RASSF1 A and/or rts 
regulatory sequences, and wherein it is further preferred that the sequence of said gene com- 
prises SEQ ID NO: 5. The object of the invention may also be achieved by analysis of the 
^methylation pattern, of tire gene APC and/or its regulatory sequences, and wherein it is tourer 
I^B preferred that the sequence of said gene comprises SEQ ID NO: 3. 

' In a preferred embodiment said method is.achieved by contacting said nucleic acid sequences 

wherein said reagent or series of reagents, distinguishes between mediated arrd non mettryl- 
atedCpGdinucleotides within the target nucleic acid . . • 

In a preferred embodiment, the method comprises the following steps: 
In the first step of the method the genomic DNA sample must be isolated from sources sueh 
. as cetis or ceUular components which contain DNA sources of DNA comprising, for exam- 
■ pie cell lines, histological slides, biopsies, tissue embedded in paraffin, breast tissues, blood. 



plasma, lymphatic fluid, lymphatic tissue, duct cells, ductal lavage fluid, nipple aspiration 
fluid and combinations thereof. Extraction may be by means that are standard to one skilled in 
thfe art, these include the use of detergent lysates, sonification and vortexing with glass beads. 
Once the nucleic acids have been extracted the genomic double stranded DNA is used in the 
analysis. 

In one embodiment the DNA may be cleaved prior to the next step of the method, this may be 
by any means standard in the state of the art, in particular, but not limited to, with restriction 
endonucleases. 

In the second step of the method, the genomic DNA sample is treated in such a manner that 
. cytosine bases which are unmethylated at the 5 '^position, are converted to uracil,. thymine, or 
another base which is dissimilar to cytosine in terms of hybridisation behaviour. This will be 
understood as "pretreatment" or "chemical pretreatment" hereinafter. 

The above described treatment of genomic DNA is preferably carried out with bisulfite (sul- 
fite, disulfite) and subsequent alkaline hydrolysis which results in a conversion of non- 
methylated cytosine nucleobases to uracil or to. another base which is dissimilar to cytosine in 
terms of base pairing behaviour. If bisulfite solution is used for the reaction, then an addition 
takes place at the non-methylated cytosine bases. Moreover, a denaturating reagent or solvent 
as well as a radical interceptor must be present. A subsequent alkaline hydrolysis then gives 
.rise to the conversion of non-methylated cytosine nucleobases to uracil. The converted DNA 
is then used for the detection of methylated cytosines. 

Fragments of the pretreated DNA are amplified, using sets of primer oligonucleotides, and a 
preferably heat-stable, polymerase. Because of statistical and practical considerations, pref- 
erably more than six different fragments having a length of 100 - 2000 base pairs are ampli- 
fied. The amplification of several DNA segments can be carried out simultaneously in one 
and the same reaction vessel. Usually, the amplification is carried out by means of a polymer- 
ase chain reaction (PCR). 

The design of such primers is obvious to one skilled in the art. These should include at least 
two oligonucleotides whose sequences are each reverse complementary or identical to an at 
least 18 base-pair long segment of the following base sequences specified in the appendix: 




SEO ID NO 6 to 26. Said primer oligonucleotides are preferably characterised in that they do 
n „t con** any CpG dinueieoudes. In a particuUrly preferred embodiment of the method, the 
seq uence of said primer otigonucleotides are designed so as .to selective* anneal <o and am- 
Jy only the breast ceU specific DNA of interest, thereby minimising the amplrficaUon of 
llground or non relevant DNA. In the context of me present invention, background DNA ts 
taken to mean genomic DNA which does no. have a relevant tissue specific methylahon pat- 
tern, in this case, the relevant tissue being breast tissues. 

According to the present invention, it is preferred that at leas, one primer oligonucleotide is 
b01m d to a solid phase during amplification. The different oligonucleotide and/or PNA- 
oUgomer sconces ctm be arranged on a plane sotid phase in the form of a reotanguta or : 
hexagonal lattice, the sotid pta* surface preferably being composed of silicon, glass, polysty- 
1 aluminium, stee,, iron, copper, nickel, silver, or gold, it being possime for other materr- 
als such as nitrocellulose or plastics to be used as well. 

The ftagments obtained by means of the amplification may carry a dixectly or mdixeotty de- 
tectable label. Preferred axe Ubels inthe form of fluorescence labels, radionuchdes, or detach- 
■ ewemolecmefm^havmga^icmmasswmohcanb.de.eetedinamassspectromet^ 

i, being preferred that the fragments that are produced have a single posihve or negative net 
charge for better detectabitity in me mass spectrometer. The detection may be earned out and 
vi^Oised by means of matrix assisted laser desorption/ionisation mass spectrometry 
(MALDI) or using electron spray mass spectrometry (ESI). 

•ft in the next step me nucleic acid amplifies are analysed in order to determine the methyla- 
tion status of the genomic DNA prior to treatment 

. The post treatment analysis of the nucleic acids may'be carried out using alternative method. 
Several methods for the memylation status speoifib analysis of me treated nuderc actds are 

The analysis may be carried out during the amplification step, of tire me^od. In one ^^em- 
bodiment, me memylation stems of presented CpO positions witiun tire gene, ESR1, APC 
HSD174B4, HIC, and RASSF1A and/or titeir regulatory regions may be detected by uae o 
motivation specific primer oligonucleotides. The term «MSP" (Methylat.on-spec.fic PGR) 





refeIS ,0 the art-recognized ***** assay described by Herman e, al. Pro, Ml Aca* . 
Sci USA 93:9821-9826, 1996, and a!so disclosed in US Patents No. 5,786,146 and No. 
6 265 171 The use of methylation status specific primers for the amplification of bisulpmte 
tteaed DNA atiows the differentiation between methylated and unmefoylated nucleic acids. 
MSP primers pairs contain a. tat one primer which hybridises to a bisulphite treated CpG 
dinucleotide. Therefore tire sequence of said primers comprises at least on. CO, TO or CA 
dinudeotide. MSP primers specific for non mediated DNA contain a T at the 3' posttion of 
«. C position in the CpG. According to the present invention, it is therefore preferred that the 
base sequence of said primers is required to comprise a sequence having a length of at least 10 
nudeotides which hybridises to a predated nucleic acid sequence according to SEQ D> 
NOS.: 6 to 26 and sequences complementary thereto wherein the base sequence of sard oh- 
gomers comprises at least one CO, TG or C A dinucleotide. 

to one embodiment of tire mettrod the methylation status of the CpG positions may be : deter- 
mined by means of hybridisation analysis, tothis embodiment of the mefood tire amphficates 
obtained in tire second step of the memod are hybridised to an array or a se, of oligonucleo- 
tides and/or PNA probes, to mis context, tire hybridisation takes place in foe manner de- 
scribed as follows. The set of probes used during the hybridisation is preferably composed of 
a, least 4 oligonucleotides or PNA-otigomers. In tire process, the amplifroa.es serve as probes 
which hybridise to oligonucleotides previously bonded to a solid phase. The non-hybndrsed 
fragments are subsequently removed. Said oligonucleotides contain at least one base sequence 
having a lengtir of 10 nucleotides which is reverse complementary or identical to a segment of 
me base Sequences specified in tire appendix, the segment containing at least one CpG or TpG 
dinucleotide. In a further preferred embodiment tire oytosme of tire CpG dinudeotide, or » 
foe case of TpG, foe thiamme, is foe 5* to 9* nucleotide from foe 5'-end of foe 10-mer. One 
oligonucleotide exists for each CpG or TpG dinucleotide. 

The non-hybridised amplifies are foen remove* to foe final step of foe mefoo d foe hy- 
bridised amputates are detected. In fois context, it is preferred that labels attaphed to foe 
amplificates are identifiable at each position of foe solid phase a, which an oligonucleotide 
sequence is located. 

to a preferred embodiment of foe mefood foe mefoylation status of foe CpG positions maybe 
determtoed by means of oligonucleotide probes that are hybridised to foe treated DNA eon- 





currently with the PCR amplification primers (wherein said primers may either be methyla- 
• tion specific or standard). 

A particularly preferred embodiment of this method is the use of fluorescence-based Real 
Time Quantitative PCR (Heid et al., Genome Res. 6:986-994, 1996) employing a dual- 
labelled fluorescent oligonucleotide probe (TaqMan™ PCR, using an ABI Prism 7700 Se- 
quence Detection System, Perkin Elmer Applied Biosystems, Foster City, California). The 
TaqMan™ PCR reaction employs the use of a nonextendible interrogating oligonucleotide, 
called a TaqMan™ probe, which is designed to hybridise to a GpC-rich sequence located 
between the forward and reverse amplification primers. The TaqMan™ probe further com- 
prises a fluorescent "reporter moiety" and a "quencher moiety" covalently bound to linker 
moieties (e.g., phosphoramidites) attached to the nucleotides of the TaqMan™ oligonucleo- 
tide. For analysis of methylation within nucleic acids subsequent to bisulphite treatment it is 
' required that the probe be methylation specific, as described in U.S. 6,33 1 ,393, also known as 
the Methyl Light assay. Variations on the TaqMan™ detection methodology that are also 
suitable for use with the described invention include the use of dual probe technology 
' (Lightcycler™) or fluorescent amplification primers (Sunrise™ technology). Both these tech- 
niques may be adapted in a manner suitable for use with bisulphite treated DNA, and moreo- 
ver for methylation analysis within CpG dinucleotides. 

A further suitable method for the use of probe oligonucleotides for the assessment of meth- 
ylation by analysis of bisulphite treated nucleic acids is the use of blocker oligonucleotides. 
The use of such oligonucleotides has been described by D. Yu, M. Mukai, Q. Liu, C. Stein- 
man in BioTechniques 23(4), 1997, 714-720. Blocking probe oligonucleotides are hybridised 
to the bisulphite treated nucleic acid concurrently with the PCR primers. PCR amplification of 
the nucleic acid is terminated at the 5' position of the blocking probe, thereby amplification of 
a nucleic acid is suppressed wherein the complementary sequence to the blocking probe is 
present. The probes may be designed to. hybridise to the bisulphite treated nucleic acid in a 
methylation status specific manner. For example, for detection of methylated nucleic acids 
within a population of unmethylated nucleic acids suppression of the amplification of nucleic 
acids which are unmethylated at the position in question would be carried out by the use of 
blocking probes comprising a 'CG' at the position in question, as opposed to a 'CA. 



For PCR methods using blocker oligonucleotides, efficient disruption of polymerase-mediated 
amplification requires that blocker oligonucleotides not be elongated by the polymerase. Pref- 
erably, this is achieved through the use of blockers that are 3'-deoxyoUgonucleotides, or oli- 
gonucleotides derivatised at the 3' position with other than a "free" hydroxyl group. For ex- 
ample, 3'-0-acetyl oligonucleotides are representative of a preferred class of blocker mole- 



exile. 





Additionally, polymerase-mediated decomposition of the blocker oligonucleotides should be 
precluded. Preferably, such preclusion comprises either use of a polymerase lacking 5'-3' 
exonuclease activity, or use of modified blocker oligonucleotides having, for example, thioate 
bridges at the 5'-terminii thereof that render the blocker molecule nuclease-resistant Particu- 
lar applications may not require such 5' modifications of the blocker. For example, if the 
blocker- and primer-binding sites overlap, thereby precluding binding of the primer {e.g., with 
excess blocker), degradation of the blocker oligonucleotide will be substantially precluded. 
This is because the polymerase will not extend the primer toward, and through (in the 5'-3' 
direction) the blocker - a process that normally results in degradation of the hybridized 
blocker oligonucleotide. 

A particularly preferred blocker/PCR embodiment, for purposes of the present invention and 
as implemented herein, comprises the use of peptide nucleic acid (PNA) oligomers as block- 
ing oligonucleotides. Such PNA blocker oligomers are ideally suited, because they are neither 
decomposed nor extended by the polymerase. 

Preferably, therefore, the base sequence of said blocking oligonucleotides is required to com- 
prise a sequence having a length of at least 9 nucleotides which hybridises to a pretreated nu- 
cleic acid sequence according to one of SEQ ID NOs: 6 to 26 and sequences complementary 
thereto, wherein the base sequence of said oligonucleotides comprises at least one CpG, TpG 
or CpA.dinucleotide. 

In a further preferred embodiment of the method the detennination of the mediation status 
of the CpG positions is carried out by the use of template directed oligonucleotide extension, 
such as MS SNuPE as described by Gonzalgo and Jones (Nucleic Acids Res. 25:2529-2531). 





In a further embodiment of the method the determination of the methylation status of the CpG 
positions is enabled by sequencing and subsequent sequence analysis of the amplificate gen- 
erated in the second step of the method (Sanger F., et al., 1977 PNAS USA 74: 5463-5467). 

The method according to the invention may be enabled by any combination of the above 
means. In a particularly preferred mode of the invention the use of real time detection probes 
is concurrently combined with MSP and/or blocker oligonucleotides. 

A further embodiment of the invention is a method for the analysis of the methylation status 
of genomic DNA without the need for pretreatment. In the first and second steps of the 
method the genomic DNA sample must be obtained and isolated from tissue or cellular 
sources. Such sources may include cell lines, histological slides, biopsies, tissue embedded m 
- paraffin, breast tissues, blood, plasma, lymphatic fluid, lymphatic tissue, duct cells, ductal 
lavage fluid, nipple aspiration fluid and combinations thereof. Extraction may be by means 
that are standard to one skilled in the art, these include the use of detergent lysates, sdnifica- 
tion and vortexing with glass beads. Once the nucleic acids have been extracted the genomic 
double stranded DNA is used in the analysis. 

In a preferred embodiment the DNA may be cleaved prior to the treatment, this may be by 
any means standard in the state of the art, in particular with restriction endonucleases. In the 
third step, the DNA is then digested with one or more methylation sensitive restnction en- 
zymes. The digestion is carried out such that hydrolysis of the DNA at the restriction site is 
informative of the methylation status of a specific CpG dinucleotide. 

In a preferred embodiment the restriction fragments are amplified. In a further preferred em- 
bodiment this is carried out. using the polymerase chain reaction. 

In the final step the amplifiers are detected. The detection may be by any means standard in 
the art' for example, but not limited to, gel. electrophoresis analysis, hybridisation analysis, 
incorporation of detectable tags within the PGR products, DNA array analysis, MALDI or 
ESI analysis. 

The aforementioned method is preferably used for ascertaining genetic and/or epigenetic pa- 
rameters of genomic DNA. 



In order to enable this method, the invention further provides the modified DNA of one or a 
combination of genes takenfrom to group ESR1, APC, HSD174B4, HIC1 and RASSF1A as 
well as oligonueleotides and/or PNA-oligomers for deteeting eytosine methylations wrtlun 
said genes. The present invention is based on to discovery tot genetie and ep.genenc pa- 
meters and, in particular, to eytosine mediation patterns of said genomic DNAs are par- 
ticularly suitable for improved treatment and monitoring of breast ceU proliferative disorders. 

Tfre nucleic acids according to the present invention can be used for to analysis of genetic 
and/or epigenetic parameters of genomic DNA. 

In another aspect of to present invention, to object of to present invention is achieved us- 
ing a nucleic acid containing a sequence of at least 18 bases in length of to pretreated geno- 
mic DNAaccording to one of SBQ ID NOs: 6 to 25 and sequences -complementary thereto. 

The modified nucleic acids could heretofore not be connected with the ascertainment of dis- 
ease relevant genetic and epigenetic parameters. 

The object of to present invention is further achieved by an oligonucleotide or oligomer for 
the analysis of pretreated DNA, for detecting the genomic eytosine methylation state, sam 
oligonucleotide containing at least one base sequence having a length of at least 10 nucleo- 
tides which hybridises to a pretreated genomic DNA according to SEQ ID Nos: 6 to 26. The 
oligomer probes according to to present invention constitute, important and effective tools 
which, for to first time, make it possible to ascertain specific genetic and epigenetic parame- 
ters dnring to analysis of biological samples for featn.es associated with, a patient's response 
* endocrine treatment. Said oligonucleotides allow to improved treatment and monitonng of 
breast cell proUfemtive disorders. The base sequence of to oligomer preferably contains a, 
lens, one CpG or TpG dinncleotide. The probes may also exist in the form of a PNA (peptide 
nucleic acid) which has particularly preferred pairing properties. Particularly preferred are 
oligonucleotides according to the present invention in which the eytosine of to CpG dmu- 
cleotide is within to middle third of said oligonucleotide e.g. to 5* - 9* nucleotide from 
the 5'-end of a 13-mer oligonucleotide; or in the case of PNA-oligomers, it is preferred for to 
eytosine of to CpG dinucieotide to be to 4* - 6* nucleotide from to 5'-end of to 9-mer. 




1^ 



The oligomers aceording ,0 <he present invention are normally used in so called "sera" which 
contain at least two oligomers and .op to one oligomer for each of the CpG dinucleotidea 
vvithin SEQ ID NOs: 6 to 26. 

In the case of the sets of oligonucleotides according to the present invention, it is preferred 
that at least one oligonucleotide is bound to a solid phase. It is father preferred that all the 
oligonucleotides of one set are bound to a solid phase. 

The present invention further relates to a set of at least 2 n (oUgonucleotides and/or PNA- 
oligomers) used for detecting the cytosine methylation state of genomic DNA, by analysis of 
sain sequence or treated versions of said sequence (of the genes ESRl, APC, HSD174B4, 
ffldl and RASSF1A, as detailed in the sequence listing and Table l)and sequences comple- 
mentary thereto). THese probes enable improved treatment and monitoring of breast cell pro- 
liferative disorders. 

Tie set of oligomers may also be used for detecting single nucleotide polymorphisms (SNPs) 
by analysis of said sequence or. treated versions of said sequence of the genes ESRl, APC, 
HSD174B4.HIC1 andRASSFlA. 

I, will be obvioua to one skilled in the art that the method according to the invention will be 
improved and supplemented by the incorporation of markers and clinical indicators known » 
the state of tire art and currentiy used as diagnostic or prognostic marker, More preferably 
said markers include node status, age, menopausal status, grade, estrogen and progesterone 
receptors. 

The genes that form the basis of the present invention may be used to form a "gene panel", i.e. 
a collection comprising the particular genetic sequences of tire present invention and/or therr 
respective informative methylation sites. The formation of gene panels allows for a qutck and 
■ specific analysis of specific aspects of breast cancer treatment. Tne gene panel(s) ¥ descnbed 
and employed in this invention can be used with surprisingly high efficiency for the treatment 
of breast cell proliferative disorders by prediction of the outcome of treatment with a therapy 
comprising one or more drugs which Urge, tire eatiogen receptor pathway or are involved » 
■ esnogen metabolism, production, or secretion. The analysis of each gene of the panel contnb- 
utes to tire evaluation of patient responsiveness, however, in a less prefered embodunent tire 



patient evaluation may be achieved by analysis of only a single gene. The analysis of a single 
member of the 'gene panel' would enable a cheap but less accurate means of evaluating pa- 
tient responsiveness, the analysis of multiple members of the panel would provide a rather 
more expensive means of carrying out the method, but with a higher accuracy (the technically 
preferred solution). 

According to the present invention, it is preferred that an arrangement of different oligonu- 
cleotides and/or PNA-oligomers (a so-called "array") made available by the present invention 
is present in a manner that it is likewise bound to a solid phase. This array of different oligo- 
nucleotide- and/or PNA-oligomer sequences can be characterised in that it is arranged on the 
solid phase in the form of a rectangular or hexagonal lattice. The solid phase surface is pref- . 
erably composed of silicon, glass, polystyrene, durninium, steel, iron, copper, nickel, silver, 
■ or gold. However, nitrocellulose as well as plastics such as nylon which can exist in the form 
of pellets or also as resin matrices are suitable alternatives. 

Therefore, a further subject matter of the present invention is a method for manufacturing an 
, array fixed to a carrier material for the improved treatment and monitoring of breast cell pro- 
liferative disorders. In said method at least one oligomer according to the present invention is . 
coupled to a solid phase. Methods for manufacturing such arrays are known, for example, 
' from US Patent 5,744,305 by means of solid-phase chemistry and photolabile protecting 
groups. 

A former subject matter of the present invention relates to a DNA chip for theimproved 
treatment and monitoring of breast cell proliferative disorders. The DNA chip contains at least 
one nucleic acid according to the present invention. DNA chips are known, for example, m 
US Patent 5,837,832. „. 

Moreover, a subject matter of the present invention is a kit which may be composed, for ex- 
ample, of a bisulfite-containing reagent, a set of primer oligonucleotides containing at least 
two oligonucleotides whose sequences in each case correspond to or are complementary to a 
18 base long segment of the base sequences specified in SEQ ID.NOs: 6 to 26 and/or PNA- 
oligomers as well as instructions for carrying out and evaluating the described method. 



In a further preferred embodiment said kit may further comprise standard reagents for per- 
forming a CpG position specific methylation analysis wherein said analysis comprises one or 
more of the following techniques: MS-SNuPE, MSP, Methyl light, Heavy Methyl, and nu- 
cleic acid sequencing. However, a kit along the lines of the present invention can also contain 
only part of the aforementioned components. • 

Typical reagents {e.g., as might be found in a typical MethyLightO-based kit) for Meth- 
yLight® analysis may include, but are not limited to: PCR primers for specific gene (or meth- 
ylation-altered DNA sequence or CpG island); TaqMan® probes; optimized PCR buffers and 
deoxynucleotides; and Taq polymerase. 

Typical reagents (e.g., as might be found in a typical Ms-SNuPE-based kit) for Ms-SNuPE 
analysis may include, but are not limited to: PCR primers for specific gene (or methylation- 
altered DNA sequence or CpG island); optimized PCR buffers and deoxynucleotides; gel ex- 
traction kit; positive control primers; Ms-SNuPE primers for specific gene; reaction buffer 
(for the Ms-SNuPE reaction); and radioactive nucleotides. Additionally, bisulfite conversion 
reagents may include: DNA denaturation buffer; sulfonation buffer; DNA recovery regents or 
kit (e.g., precipitation, ultrafiltration, affinity column); desulfonation. buffer; and DNA recov- 
ery components. 

Typical reagents (e.g., as might be found in a typical MSP-based kit) for MSP analysis may 
include, but are not limited to: methylated and unmethylated PCR primers for specific gene 
(or methylation-altered DNA sequence or CpG island), optimized PCR buffers and deoxynu- 
cleotides, and specific probes. 

The oligomers according to the present invention or arrays thereof as well as a kit according 
to the present invention are intended to be used for the improved treatment and monitoring of 
breast cell proliferative disorders. According to the present invention, the method is prefera- 
bly used for the analysis of important genetic and/or epigenetic parameters within genomic 
DNA, in particular for use in improved treatment and monitoring of breast cell proliferative 
disorders. 



The methods according to the present invention are used, for improved detection, tr< 
and monitoring of breast cell proliferative disorder. - • 



The present invention moreover relates to the diagnosis and/or prognosis of events winch are 
disadvantageous or relevant to patients or individuals in which important genetic and/or epx- 
genetic parameters within genomic DNA, said parameters obtained by means of the present 
invention may be compared to another set of genetic and/or epigenetic parameters, the differ- 
ences serving as the basis for the diagnosis and/or prognosis of events which are disadvanta- 
geous or relevant to patients or individuals. 

In the context of the present invention the term "hybridisation" is to be understood as a bond, 
of an oligonucleotide to a completely complementary sequence along the lines of the Watson-. 
Crick base pairings in the sample DNA, forming a duplex structure. 

In the context of the present invention, "genetic parameters" are mutations and polymor- 
phisms of genomic DNA and sequences further required for their regulation. To be designated 
as mutations are, in particular, insertions, deletions, point mutations, inversions and polymor- 
phisms and, particularly preferred, SNPs (single nucleotide polymorphisms). 

In the context of the present invention the term "mediation state" is taken to mean the de- 
gree of methylation present in a nucleic acid of interest, this may be expressed in absolute or 
relative terms i.e. as a percentage or other numerical value or by comparison to anomer tissue 
and therein described as hypermethylated, hypomethylated or as having significantly Similar 
or identical methylation status. 

In the context of the present invention the term "regulatory region" of a gene is taken to mean 
nucleotide sequences which affect the expression of a gene. Said regulatory regions may be 
located within, proximal or distal to said gene. Said regulatory regions include but are not 
limited to constitutive promoters, tissue-specific promoters, developmental-specific promot- 
ers inducible promoters and the like. Promoter regulatory elements may also include certain 
enhancer sequence elements that control transcriptional or translational efficiency of the gene. 

In the context of the present invention the term "chemotherapy" is taken to mean the use of 
drugs or chemical substances to treat cancer. This definition excludes radiation therapy 
(treatment with high energy rays or particles), hormone therapy, (treatment with hormones or 
hormone analogues (synthetic substitutes) and surgical treatment. 



In the context of the present invention, "epigenetic parameters" are, in particular, cytosme 
m ethylations and further modifications of DNA bases of genomic DN A and sequences further 
required for their regulation. Further epigenetic parameters include, for example, the acetyla- 
tion of histones which, cannot be directly analysed using the described method but winch, m 
"turn, correlates with the DNA methylation. 

In the following, the present invention will be explained in greater detail on the basis of the 
sequences, figures and examples, without being limited thereto. 

Figure 1 shows the Kaplan-Meier estimated overall survival curves for the gene APC, for a 
set of 86 breast cancer patients. The dotted line (upper curve) shows unmethylated samples 
whereas the unbroken line (lower curve) shows methylated samples. The x-axis shows the 
number of years, and the Y-axis shows the proportion of the group. 

Figure 2 shows the Kaplan-Meier estimated overall survival curves for the gene RASSFIA, 
for a set of 86 breast cancer patients. The dotted line (upper curve) shows unmethylated sam- 
ples whereas the unbroken line (lower curve) shows methylated samples. The x-axis shows 
the number of years, and the Y-axis shows the proportion of the group. 

Figure 3 shows the combined Kaplan-Meier estimated, overall survival curves for the genes 
APC and/or RASSFIA, for a set of 86 breast cancer patients. The dotted line (upper curve) 
shows unmethylated samples whereas the unbroken line (lower curve)shows methylated- 
samples. The x-axis shows the number of years, and the Y-axis shows the proportion of the 



group: 



SEQ ID NOs: 1 to 5 represent 5' and/or regulatory regions and/or CpO rieh regions of the 
genes according to Table 1. These sequences are derived fain Genbank and will be taken to 
include all minor variations of the sequence material which are currently unforeseen, for ex- 
ample, but not limited to, nrinor deletions and SNPs. 

SEQ ID NOs: 6 to 26 exhibit tire preheated sequence of DNA derived tan the genomic se- 
. quence according to Table. 1. These sequences will be taken to include aU minor variations of 



the sequence material which are currently unforeseen, for example, but not limited to, minor 
deletions and SNPs. 

Examples 

Using MethyLight, a high-throughput DNA mediation assay, the inventors analysed 3? 
genes in a gene evaluation set, consisting of ten sera from metastasised patients, 26 patients 
with primary breast cancer and ten control patients. In order to determine the prognostic value 
of genes identified within the gene evaluation set, the inventors finally analysed pretreatment 
sera of 24 patients having had no adjuvant treatment (training set) to deterrnine their prognos- 
tic value. An independent test set consisting of 62 patients was then used to test the validity of. 
genes and combinations of genes, which in the training set were found to be good prognostic 
markers. 

In the gene evaluation set the inventors identified five genes (ESR1, APC, HSD17B4, HIC1 
and RASSF1A). In the training set, patients with methylated serum DNA for RASSF1A 
and/or APC had the worst prognosis (p < 0.001). This finding was confirmed by analysing 
serum samples from the independent test set (p = 0.007). When analysing all 86 investigated 
patients, multivariate analysis showed methylated RASSF1A and/or APC serum DNA to be 
independently associated with poor outcome, with a relative risk for death of 5.7. DNA meth- 
ylation. of particular genes in pretherapeutic sera of breast cancer patients, especially of 
RASSF1A/APC, is more powerful than standard prognostic parameters. 

The gene evaluation set consisted of patients with recurrent disease (n=10; sera obtained at 
diagnosis of metastasis in the bone, lung, brain or liver) and pretherapeutic sera of recently 
diagnosed primary breast cancer patients (n=26; age range: 36.1 yrs to 83.9 yrs. (mean: 59.3 
yrs )• two 18 and six patients had pTl, pT2 and P T3 cancers, respectively; 15, ten and one 
patients had lymph node-negative, - positive and unknown disease, respectively) and normal 
controls (n=10; age range: 20.5 to.7i:5 yrs. (mean: 44.6 yrs.); all underwent a core biopsy and 
were confirmed to have benign disease of the breast). 

To assess prognostic significance the inventors used pretherapeutic sera in independent train- 
ing (n=24) and test (n=62) sets consisting of patients who did not receive any adjuvant treat- 
ment after surgery, 



23 4. 



Systemic adjuvant therapy was either not necessary or the patients were not ebgtble or refused 
Jy further treatment. The primary surgical procedure included breast-conservmg lumpec- 
J my or modified radical mastectomy and Hilary lymph node dissection. Medran age of *e 
J, popuiation was 60 years (range, 28 to 86 yr,). After a median follow-up o 3^ 

range o- -nmto !2.2 yr,) 17 of me 86 patients (20 N) had died. Disttibution of a ermnt 
Tl DNA mediation of the 86 patients and association with clinical and hrstopathoiogrcal 

characteristics are shown in Table 2. 

patterns' b,ood samples were drawn prior to therapeutic intervention The blood was centti- 
Ld a, 2000 g for 10 min a. room temperature, and 1 mL aliquot of semm samples wer 
^ a. -30°C Genomic.DNA from serum samples washed using me High-Pure V-m, 
Nucleic Acid Kit (Roche Diagnostics, Mannheim, Germany) according to tire manufacturer s 
protooo. with some modifications for multiple loading of the DNA extraction columrn ,* . gam 
a sufficient amount of DNA. Thus, 4 x 200pJ of a serum sample were each mrxed wrti, 20 pi 
working solution (binding buffer supplemented with polyA carrier KNA) and 5 0pl prote> 
nase K [1 mg/ml] and incubated for 10 minutes a* 72°C. After adding lOOu. rsopropanol *e 
72™ lei, loaded onto the action colunm and centtimged for 1 mmute a, 8000, 
The flow-through was pipetted back into the same column reservoir and centred a second 
toe This precis wss repeated four times for each serum sample. After these « 
ste ps» me DNA isolation was processed as described in me manufacturer s protocol. Fo 
DNA elution 55pl of AE-buffer (Quiagen, CA, USA), were added, incubated for 20 mm a 
' 45«C and centiifuged for three minutes at.12.000g. For bom, norma! sera and cancer sera 
analysis the same amount of serum for DNA extraction was used. 

Sodium bisuffite conversion of genomic DNA was performed as described previously (Eads 
et al, MethyLight. a high-throughpu, assay to measure DNA memylatton. Nuclerc Acrds 
Res., 28: E32, 2000). 

Sodium bisulfi.e-treated genomic DNA was analysed by means of the MethyUght, a fluores- 
oence-based, real-time FOR assay, as described previously (Eads et al 2000, 
et al, Epigenetic patterns in the progression of esophagenl adendc~ Cancer to^. 
3410-3418, 2001). Briefly, two sets of primers and prob*, designed speedy for b.sn,fite- 
converted DNA, were use4: a methylated se, for tire gene of interest and a reference set, B, 




actin (ACTS), to normalise for input DNA. Servun samples of patients with recurrent disease 
revealed to highest tunoun, of B-actin, whereas no difference between B-actin values ffonr 
ser um samples of patients with primary breast cancer and sera.of nonna! controls was ob- 
5 erved Specificity of me reactions for methylated DNA was confirmed separately usrng tol 
(New England Biolabs)-treated human white blood cell DNA (heavily methylated). The per- 
centage of fully methylated molecules at a specific .ecus was calculated by dWiding to 
GENE-ACTB ratio of a sample by to GENEACTB ratio of M-treated white blood cell DNA 
and multiplying by 100. The abbreviation PMR (fiercenfage of fully methylated reference) 
indicates mis measurement For each MemyLight reaction lOul of bisnlfite-treated genomrc 
DNA was used. 

A gene was deemed memylated if the PMR value was > O.Primer and probes specific for 
methylated DNA and used for MemyLight reactions are listed in Supplemental Data. 

The inventors used Pearsons Chi' or - in to case of low fluencies per cell - Fisher's exact 
me tod ,o test associations between categorically ciinicopatho.ogical features. The Mann- 
■ Whitney-U-Test was used to assess differences between non-parametric distributed vanables. 
Overall survival was calculated from the date of diagnosis of to primary tiunour to to date 
of death or las. follow-up. Overa.1 survival curves were calculated with to Kaplan-Merer 
method! Univariate analysis of overall survival according to clinicopathologrcal factors On- 
tologies! type, ttrmoux stage, nodal status, grading, menopausal status, hormone receptor 
status (estrogen and/or progesterone receptor positivity), estrogen and progesterone receptor 
status) and gene methylation were performed using a two-sided log-rank test 

Multivariate Cox proportional hazards analysis was used to estimate to prognostic effect of 
methylated genes. 

A p value < 0.05 was considered a statistically sipufican, difference. All statistics! analyses 
were performed using SPSS Software 10.0. . . - 

The inventors initiaUy investigated 39 genes in to sera of ten patients with metasfcsised 
breast career for to presence of aberrant metirylation. The 33 genes positive in to sera of to 
metastasised patients wem fwtor evaluated in an independent sample set of pretorapeutic 
sera of 26 patients with primary breast cancer and ten healthy consols. An overvrew of to 



frequency of methylation in the investigated serum samples is given in Table 3. The most 
appropriate genes for our further analyses were determined to' be those that met one of the 
following criteria: (i) unmethylated in serum samples from healthy controls and > 10% meth- 
ylated in serum samples from primary breast cancer patients, or (ii) £ 10% methylated in se- 
rum samples from healthy controls and >20% methylated in serum samples from primary 
breast cancer patients. A total of five genes, namely ESR1, APC, HSD17B4, HIC1 and 
RAS SF 1 A, met at least one of these criteria (Table 3). 

Pre-treatment serum samples from patients included in the training set were used to evaluate 
the prognostic value of the methylation status of these five genes. In this training set the in- 
ventors' identified ESR1, APC or RASSF1A methylation in primary breast cancer patients' 
sera to be markers of poor prognosis, whereas HSD17B4 reached only borderline significance 
and aberrant methylation of HIC1 showed no significant results (Table 4). Furthermore, vari- 
ous combinations of the investigated genes were analyzed. Patients were classified as meth- 
ylation-positive if at least one of the genes included in the combination showed aberrant 
methylation. Patients with methylated serum DNA for RASSF1A and/or APC had the worst 
prognosis (P < 0.001), even worse than when each gene was analysed individually (Table 4). 

The highly significant prognostic value for APC and/or RASSF1 A methylation in serum sam- 
ples from breast cancer patients was confirmed by analysing the test set (P = 0.007, log rank 
test). ESR1 and APC methylation as single genes or the combinations ESR1/RASSF1A and 
ESR1/APC no longer had prognostic significance (Table 4). . 

Combined analysis of the training and test sets (n=86) showed correlation between ESR1 and 
RASSF1A (P = 0.005) and between ESR1 and APC (P = 0.031), whereas no correlation was 
observed between RASSF1A and APC: In patients, with advanced tumours RASSF1A and 
ESR1 methylation and in patients with progesterone receptor-negative tumours APC meth- 
ylation was more prevalent in pretherapeutic sera, while no further associations were seen 
between clinicopathological features and DNA methylation of APC, ESR1 or RASSF1 A. (Ta- 
ble 5). RASSF1A methylation in pretherapeutic sera was more prevalent in older than in 
younger patients, whereas age had no effect on DNA methylation of ESR1 or APC. 

Univariate analysis of all 86 investigated patients (training set plus test set) revealed prognos- 
tic significance for tumour size, lymph node metastases and methylation status of APC, 



RASSF1A and the combination of RASSF1A/APC (Table 6; Fig. 1). Due to the fact that 
ESR1 mediation correlates with APC as well as with RASSF1A mediation, the inventors 
did not test the triple combination in the univariate or the multivariate analyses of all 86 pa- 



tients. 



The Cox multiple-regression analysis included tumour size, lymph node metastases, age and 
mediation status of the investigated genes. Beside lymph node status, methylated RASSF1A 
and/or APC serum DNA was strongly associated with poor outcome, with a relative risk for 
death of 5.7 (Table 7). 

Prognosis in patients with newly diagnosed breast can6er is determined primarily by the pres- 
ence or absence of metastases in draining axillary lymph nodes. Nevertheless, the life- 
threatening event in breast cancer is not lymph node metastasis per se, but haematogenous 
metastases which mainly affect bone, liver, lung and brain. The inventors therefore aimed to 
• develop a prognostic test that is sensitive for haematogenous metastases and could be per- 
formed in patients' pretherapeutic serum. 

In recent years several studies have reported cell-free tumour-specific DNA in serum/plasma 
of breast cancer patients at diagnosis. Aberrant methylation of serum/plasma DNA of patents 
with various types of malignancies, including breast cancer, has been described (see above). 

In light of these observations, the inventors examined the methylation status of 39 genes, 
which on the one hand, are known to be frequently methylated in breast cancer and other 
malignancies (Jones and Baylin: The fundamental role of epigenetic events in cancer. Nat. 
Rev Genet 3- 415-428, 2002; Widschwendter and Jones: DNA methylation and breast car- 
cinogenesis' Oncogene, 21: 5462-5482, 2002) and, on the other hand, were reported to be 
abnormally regulated in tumours of patients with poor prognostic breast cancer (van't Veer et 
al • Gene expression profiling predicts clinical outcome of breast cancer. Nature, 415: 530- 
536 2002- van de Vijver et al.: A gene-expression signature as a predictor of .survival m 
brelst cancer. N. Engl. J. Med., 347: 1999-2009, 2002) Because levels of circulating DNA m 
metastasised patients are known to be higher (Leon et al. Free DNA in the serum of cancer 
patients and the effect of therapy Cancer Res., 37: 646-650, 1977) and because the loss of 
genetic heterogeneity of disseminated tumour cells with the emergence of clinically evident 
metastasis was recently reported (Klein et al!: Genetic heterogeneity of single disseminated 





nnnour cells in minimal residual cancer. Lancet, 360: 683-689, 2002), the inventors firstly 
investigated these 39 genes in ten sera of metastasis* patients to determine fire overall 
prevalence of mediation changes in hreast cancer. As a next step the inventors analysed fir. 
33 genes that were positive in the metastasised patients, in the pre-treatment sera of 26 pa- 
tients with primary breast cancer and in ten benign controls in order to. identify fire most rm- 
portant genes for further analysis. Eventually the inventors came up with five genes (ESR1, 
APC HSD17B4 MCI and RASSF1A), which were primarily analysed in a group of 24 pa- 
tients (training set). To confirm the significance of this result the inventors tested these genes 
in an independent set of 62 patients (test set). In order to apply the strictest criteria for testing 
fire potential of a prognostic factor, the inventors investigated these markers in women, who 
had not undergone adjuvant systemic treatment. DNA methylation of APC and RASSF1A m 
pretherapeutio sera, both frequently methylated and abnormally regulated in human prrmary 
breast cancers (Dammann et al, Hypermethylation of the cpG island of Ras association do- • 
.nain family 1A (RASSF1A), a putative tumor suppressor gene from fire 3p21.3 locus, occurs 
in a large percentage of human breast cancers. Carreer Re,, 61: 3105-3109, 200!; Virmani et 
al • Aberrant methylation of fire adenomatous polyposis coli (APC) gene promoter 1A m 
breast and lung carcinomas. Clin. Cancer Res., 7: 1998-2004, 2001), turned out to be a strong 
independent prognostic parameter. These genes are involved in pathways counteracting me- 
tastasis' mediation of intercellular adhesion, stabilisation of the oytoskeleton, regulation of the 
cell cycle and apoptosis (Fearnhead et al,. The ABC of APC. Hum. Mol. Genet, 10: 721-733 
2001- Dammann et al, Epigenefic inactivation of fire Ras-association domarn faintly 1 
(RASSF1 A) gene and its function in human carcinogenesis. Histol. Histopathol., 1 8:. 665-677, 
2003) Methylated DNA in patients' prefirerapeutic serum coding for these two genes reflects 
poor prognosis. The source of fire tumour-specific DNA and its definite role in metastasrs 
remains elusive. Circulating tumour-specific altered genetic information may serve as a sur- 
rogate marker for circulating tumour cells that ultimately cause distant metastases. An alter- 
■ native, but equally attractive, hypothesis is that circulating altered DNA per se may cause de 
novo development of tumour cells in organs known to harbour breast cancer metastases. Tte 
so-called "Hypothesis of Genometastasis" suggests that malignant transformation mrght de- 
velop as a result of transfection of susceptible cells in distant target organs with dommant 
oncogenes mat circulate in fire plasma and are derived from the primary tumour. Interestingly, 
; irrespective of the source of DNA in the serum, it is noteworthy that some genes prov.de 
prognostic information when methylated in patients' sera, whereas genes like MCI, which rs 
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methylated in about 40% and 90% of primary and metastasised breast cancer patients, respec- 
tively, but in only 1 0% of healthy individuals, are not at all a prognostic parameter. 

Irrespective of the mechanistic role of methylated DNA with regards to metastasis in breast 
cancer patients, these epigenetic changes in serum have several advantages as indicators of 
poor prognosis as compared to currently used or studied prognostic parameters: DNA in se- 
rum is stable and can be analysed by a high-throughput method like MethyLight. Compared 
to bone marrow aspiration, a simple blood draw (which can be repeated any time throughout 
the follow-up period) is sufficient. The more screening mammographies are performed, the 
more small cancers are treated and after histopathological examination no tumour material 
will remain to perform RNA- and/or protein-based assays for risk evaluation. This application 
therefore demonstrates a useful and easy approach for risk assessment of breast cancer pa- 
. tients. 



Table 1 



Gene 


Genomic se- 
quence 


Bisulphite sequence 


HIC1 

NM 006497 


1 


6, 7, 16 and 17 


HSD17B4 
NM 000414 


2 


8, 9, 18 and 19 


APC 

NM 000038 


3 


10, 11, 20 and 21 


ESR1 

NM 000125 


4 


12, 13, 22 and 23 


RASSF1A 
NM 170715 


5 


14, 15, 24 and 25 



Table 2. Characteristics of training and test sets. 



Characteristics 


Training Set 
(N=24) 


Test Set 
(N=62) 


P 

Value* 




percent . 




Size of tumour . 






0.024 


Tl 


62.5 


79 




T2 


.37.5 


13 • 




T3+T4 


0 


7 




Histologic type . . 


67 


63 


n.s 


Invasive ductal 




Invasive lobular 


. 8 


13 




Others 


25 


24 . 





Tumor grade 




44 


n.s 


1 


46 




2 


33 


39 




3 


17 


10 




Lymph node metastases 




65. 


n.s.- 


No 


75 




Yes 


12.5 


1 1 
11 




Unknown 


12.5 


24 




Menopausal status 


33 


16 


n.s. 


Premenopausal 




Postmenopausal 


67 


O A 

84 


11. a « 


Estrogen-receptor status 




40 


Positive 


54 




Negative 


42 


45 


n.s. 


Progehsteron-receptor 






status 




45 




Positive 


58 




Negative 


38 




n.s. 


Hormone-receptor status 


63 


50 


Positive 




Negative 


33 


36 





*P values for the comparison of numbers of patients were calculated by means of the Chi 
test. n.s., not significant; Median age: training set (54.2 years; 37.6-83.2), test set (65.7 years; 
28.2-86.2), P = 0.052 ; Follow-up: training set (8.0 years; 1 month to 12.2. years), test set (3.1 
years; 1 month to 11. years) P < 0.001. 
Tumour grade was unknown in six cases. 
Hormone-receptor status was unknown in ten cases. 
Tumour size was unknown in one case. 

Table 3. Frequency of methylated serum DNA in the gene evaluation set. 



Gene 



Healthy Controls 
(N = 10) 



Primary Breast Can- 
cer 
(N = 26) 

percent positive 



ESRl 


0 


27 


APC 


0 


23 


HSD17B4 


0. 


12 


CDH13 


0 


.8- 


ESR2. 


. 0 


4 


MGMT 


0 


4 


SYK 


0 


4 


HICl 


10 


39 


RASSF1A 


10 


23 


GSTP1 


10 


12 


MYOD1 


20 


27 


CDH1 


20 


20 



Recurrent Breast Can- 
cer 
(N = 10) 



70 
80 
30 
40 
20 
10 
10 
90 
80 
60 
80 
90 



PTGS2 30 

PGR .30 

CALCA 40 

HLAG 60 

BLT1 60 

ARM 100 
MLLT7 . 100 

XFF1 100 

SOCS2 0 

SOCS1 0 

TERT 0 

DAPK1 0 

TIMP3 0 

BRCA1 0 

GSTM3 0 

MT3 0 

TWIST 0 

MLH1 0 

CYP1B1 0 

TITF1 0 

FGF18 0 
CDKN2A . n.d. 

HSPA2 n.d, 
PPP1R13 

B n.d. 

TP53BP2 n.d. 

REV3L n.d. 

IGFB2 

n.d., not done 



39 

46 

50 

69 

85 

100 

100 

100 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 
n.d. 
n.d. 

n.d. 
n.d. 
n.d. 
n.d. 



100 

80 

60 

100 

100 

100 

100 

100 

40 

30 

30 

30 

20 

20 

20 

20 

10 

10 

10 

10 

10 

0 

0 

0 
0 
0 
0 



Table 4. Univariate analysis of methylation status in training and test sets. 



Genes 



ESR1 

APC : 

HSD17B4 

HIC1 . 

RASSF1A 

RASSFIA/APC. 

ESR1/APC 

ESR1/RASSF1A 



Training Set 
. (N=24) 
P Value 

0.018 
0.002 
0.056 
0.796 
0.042 
. <0.001 
.0.001 
0.032 



Test Set 
(N=62) 
P Value 

0.555 
0.307 



0.014 
0.007 
0.951 
0.138 



*P values for each variable were calculated by means of the log rank test. 



Table 5. Frequency of methylated genes according to clinicopathological features. 




Characteristics 



Size of tumour 

Tl 

T2 

T3 + T4 
Histologic type 
Invasive ductal 
Invasive lobular 
Others 
Tumor grade 
1 
2 

3 - 

Lymph node metasta- 
ses 

No 

Yes 

Unknown 
Menopausal status 
Premenopausal 
Postmenopausal 
Estrogen-receptor 
Positive 
Negative. 
Progensterone- 
receptor 
Positive 
Negative 

Hormone-receptor 
status 
Positive 

Negative 



No. of 


ESR1 


APC 


RASSF1 


RASSF1 


Patients . 






A 


A and/or 








APC 




% positive 


64 . 


1 A 


i i 

1 x 


9 


19 


17 




12 


19 


31 


4 


75 


25 


50 


50 


55 


18 


15 


11 


22 


10 




0 


30 


30 


21 


1 n 


10 


10 


20 


38 


ii . 


11 


13 


21 


32 


1 Q 

1)7 


16 


16 


31 


10 


30 . 


10 


11 


11 


58 


12 


9 


9 


18 


io ; 


zu 


30 


20 . 


40 


18 


28 


11 


22 


28 


18 


28 


11 


11 


22 


68 


i v 


12 


13 


22 


38 


IO 


11 

X X 


16 


21 


38 


16 


16 


11 


27 


42 


14 


5 


14 


18 


34 


18 


24 


12 


33 


46 


15 


i 

9 


15 


20 


30 


17 


20 


10 


31 



Tumour grade was unknown in six cases. Hormone-receptor status was^ unknown in ten cases. 
Tumour size was unknown in one case. DNA mediation of RASSF1 A Tor ^one case was 
missing.Chi 2 Pearson: Tumour size - ESR1 (P = 0.005); Tumour size - ] RASSF1A (P ' - 
0.049)fprogesterone-receptor - APC (P = 0,036); Median age - RASSF 1 A meth ylated (79.0 
yrs • 49.6 to 86.2), RASSF1A unmethylated (59.4 yrs.; 28.2 to 82.3.) P = 0,009 



Table 6. Results of univariate analysis. 



Variable. 


No. of Patients 
Who Died/Total 
No. 


Relative Risk of Death 
(95% CI) 


P 

Value 

0.018 


Size of tumour 
Tl 

T2 . . 


10/64 
' '. • 5/17 


2.2(0.6-7.8) 



T3+T4 
Histologic type 
Invasive ductal 
Invasive lobular 
Others 
Tumor grade 
1 
2 
3 

Lymph node metastases 

No 

Yes 

Unknown 
Menopausal status 
Premenopausal 
Postmenopausal 
Estrogen-receptor 
Positive 
Negative 

Progensterone-receptor 
Positive 
Negative 

Hormone-receptor 
status 
Positive 
Negative 

ESR1 methylation 

Unmethylated 

Methylated 
APC methylation 

Unmethylated 

Methylated 

RASSF1 A methylation 

Unmethylated 

Methylated 
RASSF1/APC meth- 
ylation 

Unmethylated 

Methylated 



2/4 

13/55 
1/10 
3/21 

6/38 
9/32 
2/10 

7/58 
5/10 
5/18 

1/18 
16/68 

10/38 
6/38 

9/42 
7/34 



10/46 
6/30 

13/72 
4/14 

12/76 
5/10 

11/74 
6/11 



7/66 
10/19 



5.4 (0.7 - 42.9) 



0.4.(0-3.1) 
0.5 (0.1-2.1) 



2.1 (0.7-6.7) 
1.3(0.2-7.9) 



7.3 (1.7-31.7) 
2.8 (0.8 - 10.3) 



5.2 (0.6 -42.4) 
1.9(6.6-5.9) 

1.1 (0.3-3.2) 
1.1 (0.4-3.5) 

1.8(0.5-6.7) 

5.3 (1.3-21.3) 
6.9 (1.8-26.5) 

9,5(2.9-31.4) 



0.296 

0.310 

0.005 

0.062 
0.369 
0.766 
0.799 

0.370 
0.001 
0.001 
O.001 



Table 7. Multivariate Analysis. 



Variable 


Relative Risk of Death 
(95% CI) 


P 

Value 


Size of tumour 

T2 (vs. Tl) 

T3 + T4 (vs. Tl) 
Lymph node metastases 

Yes (vs. no lymph node metastases) 


2.7 (0.8-9.3) 
2.9(0.4-20.5) 

3.9(1:1-13.9) 


0.19 
6.039 
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Unknown ( vs. no lymph node metastases) 


D.Z \ \.2.-L2.. es t) 


0.06. 


Age 


1.0(1.0-1.1) 


RASSF1 A and/or APC methylated (vs. unmethyl- 


5.7(1.9-16.9) 


0.002 


ated) 
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Claims 



1. A method for determining the prognosis of a subject with a cell proliferative disorder 
of the breast tissues, said method comprising analysing the methylation pattern of a target 
nucleic acid comprising one or a combination of the genes taken from the group consisting of 
ESR1, APC, HSD174B4, HIC1 and RASSF1A and/or their regulatory regions by contacting 
at lent one of said target nucleic acids in a biological sample obtained from said subject with 
at least one reagent, or series of reagents that . distinguishes between methylated and non- 
methylated CpG dinucleotides. 

2. : A method for selecting a treatment of a cell proliferative disorder of the breast tissues, 
said method comprising: 

a ) determining the prognosis of a subject according to Claim 1, and 

b ) selecting a suitable treatment according to said prognosis. 

3. ' A method for determining the phenotype of a subject with a breast cell proliferative ; 
disorder comprising 

a) obtaining a biological sample containing genomic DNA from said subject, 

b) analysing the methylation partem of one or more target nucleic acids comprising one or a 
combination of the genes taken from the group consisting of ESR1, APC, HSD174B4, HIC1 
and RASSF1 A and/or their regulatory regions by contacting at least one of said target nucleic 
acids in the biological sample obtained from said subject with at least one reagent, or series of 
reagents that distinguishes between methylated and non-methylated CpG dinucleotides, and 

c) detenmning the phenotype of the individual by comparison to two known phenotypes, a 
first phenotype characterised by hypermethylation of the target nucleic acid and poor progno- 
sis as relative to a second phenotype characterised by hypomethylation of the analysed target 
nucleic acid and positive prognosis. 

4. A method according to claims 1 to 3 wherein said prognosis is the life expectancy. of 
said subject. . 

5. A method according to any one of claims 1 to 4 wherein said target nucleic acid com- 
prises the gene APC and/or its regulatory regions. 



6. • A method according to any one of claims 1 to 4 wherein said target nucleic acid com- 
prises the gene RASSF1 A and/or its regulatory regions. 

7. A method according to any one of claims 1 to 4 wherein said target nucleic acids 
comprise the genes APC and RASSF1 A and/or their regulatory regions. 

8. A method according to any one of claims 1 to 7, wherein said target nucleic acid or 
acids comprise essentially one or more sequences from the group consisting of SEQ ID NOs: 
1 to 5 and sequences complementary thereto. 

9. A method according to claim 5 wherein the sequence of said target nucleic acid com- 
prises essentially of SEQ ID NO: 3. 

10. A method according to claim 6 wherein the sequence of said target nucleic acid com- 
prises essentially of SEQ ID NO: 5. 

11. A method according to Claim 7, wherein said target nucleic acid or acids comprise 
essentially of SEQ ID NOs: 3 and 5. 

12. A method according to Claims 1 to 1 1, wherein. said cell proliferative disorder of the 
breast tissue is selected from the group consisting of ductal carcinoma in situ, lobular carci- 
noma, colloid carcinoma, tubular carcinoma, medullary carcinoma, metaplastic carcinoma, 
intraductal carcinoma in situ, lobular carcinoma in situ and papillary carcinoma in situ. 

13. A nucleic acid molecule consisting essentially of a sequence at least 1 8 bases in length 
according to one of the sequences taken from the group consisting of SEQ ID NOs: 6 to 25 
and sequences complementary thereto. 

14. An oligomer, in particular an oligonucleotide or peptide nucleic acid (PNA)-oligomer, 
said oligomer consisting essentially of at least one base sequence having a length of at least 
10 nucleotides which hybridises to or is identical to one of the nucleic acid sequences ac- 
cording to SEQ ID NOs: 6 to 25. 



15. The oligomer as recited in any one of Claims 13 or 14 , wherein the base sequence 
includes at least one CpG dinucleotide. 

16. A set of oligomers, comprising at least two oligomers according to any of claims 13 or 
14. 

17. A set of oligonucleotides as recited in Claim 16, characterised in that at least one oli- 
gonucleotide is bound to a solid phase. 

18. A set of at least two oligonucleotides as recited in any of claims 14 or 15, which is 
used as primer oligonucleotides for the amplification of nucleic acid sequences comprising 
one of SEQ ID NOs: 6,to 25 and sequences complementary. thereto. 

19. Use of a set of oligonucleotides comprising at least two of the oligomers according to 
any of claims 16 to 18 for detecting the cytosine methylation state and/or single nucleotide 
polymorphisms (SNPs) within the sequences taken from the group SEQ ID NOs: 1 to 5 and 
sequences complementary thereto. 

20. A method for manufacturing an arrangement of different oligomers (array) fixed to a 
carrier material for predicting the responsiveness of a subject with a cell proliferative disorder 
of the breast tissues by analysis of the methylation state of any of the CpG dinucleotides of 
the group SEQ ID NOs 1 to 5 wherein at least one oligomer according to any of the claims 14 
or 1 5 is coupled to a solid phase. 

21. An arrangement of different oligomers (array) obtainable according to claim 20. 

22. An array of different oUgonucleotide- and/or PNA-oligomer sequences as recited in 
Claim 21, characterised in that said oligonucleotides are arranged on a plane solid phase in the 
form of a rectangular or hexagonal lattice. .. - 

23. The array as recited in any of the Claims 21 or 22, characterised in that the solid phase 
surface is composed of silicon, glass, polystyrene, aluminium, steel, iron, copper, nickel, sil- 

; ver, or gold. 



24. A DNA- and/or PNA-array for predicting breast cell proliferative disorders' response 
by analysis of the methylation state of any of the CpG dinucleotides of the group SEQ ID 
NOs: 1 to 5 comprising at least one nucleic acid according to any of the claims 14 to 18. 

25. A method according to any one of Claims 1 to 4 comprising the following steps: 

a) obtaining a biological sample containing genomic DNA, 

b) extracting the genomic DNA, 

c) converting cytosine bases in the genomic DNA sample which are unmethylated at the 5- 
position, to uracil or another base which is dissimilar to cytosine in terms of base pairing be- 
haviour, 

d) amplifying at least one fragment of the pretreated genomic DNA, wherein said fragments 
comprise one or more sequences selected from, the group consisting of SEQ ID NOs: 6 to 25 
and sequences complementary thereto, and 

e) determining the methylation status of one or more genomic CpG dinucleotides by analysis 
•of the amplificate nucleic acids. 

26. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of hybridisation of at least one oligonucleotide according to Claims 14 or 15. 

27. the method as recited in Claim 25, characterised in that Step e) is carried out by 
means of hybridisation of at least one oligonucleotide according to Claims 14 or 15 and ex- 
tension of said hybridised oligonucleotide(s) by at least one nucleotide base. 

28. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of sequencing. 

29. The method as recited in Claim 25, characterised in that Step d) is carried out using 
methylation specific primers. 

30. The method as recited in Claim 25, further comprising in step d) the use of at least one 
nucleic acid molecule or peptide nucleic acid molecule comprising in each case a contiguous 
sequence at least 9 nucleotides in length that is complementary to, or hybridises under moder- 
ately stringent or stringent conditions to a sequence selected from the group consisting of 
SEQ ID NOs: 6 to 25, and complements thereof, wherein, said nucleic acid molecule or pep- 



tide nucleic acid molecule suppresses amplification of the nucleic acid to which it is hybrid- 
ised. 

31. The method as recited in Claim 25, characterised in that Step e) is carried out by 
means of a combination of at least two of the methods described in Claims 26 to 30. 

32. The method as recited in Claim 25, characterised in that the treatment is carried out by' 
means of a solution of a bisulfite, hydrogen sulfite or disulfite. 

33. A method according to any one of Claims 1 to 12 comprising the following steps: 

a) obtaining a biological sample containing genomic DNA, 

b) extracting the genomic DNA, 

c) digesting the genomic DNA comprising one or more of the sequences from 
the group consisting of SEQ ID NOs: 1 to 5 and sequences complementary thereto with one 
or more methylation sensitive restriction enzymes, and 

d) determining of the DNA fragments generated in the digest of step c). 

34. A method according to claim 33, wherein the DNA digest is amplified prior to step d). 

35. The method as recited in any one of Claims 25 to 32 and 34, characterised in that more 
than six different fragments having a length of 100 - 200 base pairs are amplified. 

36. the method as recited in any one of Claims 25 to 32, 34 and 35, characterised in that 
the amplification of several DNA segments is carried out in one reaction vessel. 

37. The method as recited in any one of Claims 25 to 32 and 34 to 36, characterised in that 
the polymerase is a heat-resistant DNA polymerase. : 

.38. The method as recited in any one of Claims 25 to 32 and 34 to 37, characterised in that 
the amplification is carried out by means of the polymerase chain reaction (PGR). 

. 39. The method as recited in any one of Claims 25 to 32 and 34 to 38, characterised in that 
the amplificates carry detectable labels. 



40. - The method according to Claim 39, wherein said labels are fluorescence labels, radio- 
nuclides and/or detachable molecule fragments having a typical mass which can be detected 
in a mass spectrometer. 

41. The method as recited in any one of Claims 25 to 32 and 34 to 40, characterised in that 
the amplificates or fragments of the amplificates are detected in the mass spectrometer. 

42. The method as recited in any one of the Claims 40 or 41, characterised in that the pro- 
duced fragments have a single positive or negative net charge for better detectability in the 
mass spectrometer. 

43. The method as recited in any one of Claims 40 to 42, characterised in that detection is 
carried out and visualised by means of matrix assisted laser desorption/ionisation mass spec- 
trometry (MALDI) or using electron spray mass spectrometry (ESI). 

44. The method as recited in any one of the Claim 1 to 25 or one of the Claims 25 to 43, 
characterised in that the genomic DNA is obtained from cells or cellular components which 
contain DNA, sources of DNA comprising, for example, cell lines, histological slides, biop- 
sies, tissue embedded in paraffin, breast tissues, blood, plasma, lymphatic fluid, lymphatic 
tissue, duct cells, ductal lavage fluid, nipple aspiration fluid and combinations thereof. 

45. A kit comprising a . bisulfite (= disulfite, hydrogen sulfite) reagent as well as oligonu- 
cleotides and/or PNA-oligomers according to any one of the Claims 14 to 15. 

46. A kit according to claim 45, further comprising standard reagents for performing a 
methylation assay from the group consisting of MS-SNuPE, MSP, Methyl light, Heavy 
Methyl, nucleic acid sequencing and combinations thereof. 

47. The use of a method according to any one of claims 1 to 12 and 25 to 44, a nucleic 
acid according to claim 13, of an oligonucleotide or PNA-oligomer or a set thereof according 
to any one of claims 14 to 18, of a kit according to Claim 45 or 46, of an array according to 
any one of claims 21 to 24 or of a method of manufacturing an array according to. claim 20 in 
the prognosis, diagnosis, treatment, characterisation, classification and/or differentiation of 
breast cell proliferative disorders. - 
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Abstract 

The present invention relates to modified and genomic sequences, to oligonucleotides and/or 
PNA-oligomers for detecting the cytosine methylation state of genomic DNA, as well as to a 
method for the prognosis and treatment of a cell proliferative disorder of the breast tissues. 
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Sequence listing 

<110> Epigenomics AG 

<!20> Prognostic and diagnostic markers for breast cell proliferative 
disorders . 

<160> 25 



<210> 1 

<211> 3501' 

<212> DNA 

<213> Homo Sapiens 



<400> 1 



gcggggctgg. 
atcggctaag 
acgggccgcg 
aggggagctc 
ctgcgagccg 
tttccccaag 
gaggcgccgg 
gcccgggtca 
tcccctggga 
ccccgatata 
ccgctgcctc 
cggccagggc 
tggggcaact 
ggtaaagttc 
actgtctcca 
gccaggtgcc 
tcctgctcct 
gtccttcgcc 
gtggagggga 
ggccggtgtg 
cgcccggcca 
tgtgcgacgt 
cggccagcag 
atgacatggt 
tggctgacgg 
tgggcgccgt 
agaaacgcct 
gcggctacgc 
aggcctgcta 
gcccagaggc 
ccgccgcact 
agaagagccc 
cgcgcccgga 
tgccgtcgct 
catttcgcgg 
gtctcctcta 
gccgggagcg" 
cgcccggggg 
ggcccggcgc 
gcgaggaccc 
atggcgagcc 
tccccagctc 
tgtacggcag 
; ttggaggcgg 
cctaccgctg 
agaagacgca 
agcgtgggac 



caggggcgct 
agctgccacc 
gggccccgcg 
agggctcggc 
agggcctggg 
tttggggcgg 
gcccgcgcgt 
gggccgcagc 
gctgcgtggc 
acgccctccc 
cgctccccgc 
ggcgccaggg 
tctcccgagg 
tccgccctga 
aaagggtcac 
ctggggcgtg 
tctcctggtc 
ggtgcccagg 
gaagtgccgg 
tgtccccgca 
ctccaggcag 
gatcatcgtg 
cgcctacctc 
gagcccggcc 
cgcagaggcg 
gctggccgcc 
caagcgccac 
gccctatggt 
cccgtcccca 
cgcSgtcaac 
ctgtgcctcg 
gccgggctcc 
cagccctccc 
gccgccgctg 
cggcagcggc 
tcgctggatg 
cggctccccc 
gcccccgctc 
gggcggcgac 
cagcccgcct 
cgagagcttc 
tgagcagctg 
ggccgaggcg 
cggggacaag 
cgcgtcgtgc 
ctggctgacc 
catgacgcgc 



gccctggcac 
gccgcgggga 
ggccaggagg 
tccgggctct 
gccggcgcac 
cggagttccg 
gtagggccca 
cggctgtgcg 
tcccccctcc 
cgcgccgggc 
gcggccgccg 
cgggcaccgc 
cgggaggcgc 
atgacttttc 
tgcgcctgaa 
caggccgccc 
cgggcgggcc 
ccgcagggct 
gctggggcca 
ggagagtgtg 
ctgctgctgc 
gtgcagaacg 
aagtccctgg 
gtgttccgcc 
gctgcggccg 
gccagctacc 
ggcaagtact 
cggccgggcc 
gtcgggcctc 
acgcactgcg 
gagcgccgct 
gcggcgccag 
agcgccggcc 
cccttccaga 
agcccgggac 
aagcacgagc 
agcgagcgct 
ggcctggcgc 
ggcgacgact 
ggcggccacc 
ggtgacaacc 
aacgcgcacg 
gccgaagtgg 
gtcgccgggg 
gacaagagct 
cggccctacc 
cacatgcgca 



agctcggggc 
ggggagcccg 
ggaacggggt 
gccgccggat 
tcctcccgcc 
ggggagaagg 
ggccgaggcc 
ccgtgcccgc 
cccccacctg 
ccggccttcg 
cccgggcccc 
gctcccctcc 
tggttcctcg 
ctgaagcgga 
cagttttctt 
tggcctcccc 
ggcctgggct 
gatgcccccg 
ggcggccagg 
ctgggcagac 
agctcaacaa 
ccctcttccg 
tggtgcatga 
tggtgctgga 
cggccgtggc 
tgcagatccc 
gccacctgcg 
ggggcctgcg 
cgccgccgcc 
ccgagctgta 
gctcccctct 
agcggccgc^: 
ccgccgccta 
agctggagga 
ccgagccccc 
cgggcctggg 
gcgaagagcg 
cgccgccgcg 
acaagagcag 
tcgagggcta 
tgtacgtgtg 
tggaggctca 
ccgctggggc 
ctccgggtgg 
acaaggaccc 
catgcaccat 
gccacctggg 



ctggcagcgg 
gcccgccggg 
cgggcgggcg 
ttgggggccg 
ctgtctgcag 
ggccggggga 
gggacgcggg 
ccggggcgct 
cttcctgcct 
cgctctgccc 
gaccgagggt 
tccgtatcac 
gctccctttc 
cattttactt 
ctcggaagcc 
tccaccggcg 
cccactccag 
ctcagctgag 
gcgccgcacg 
g.atgctggac 
ccagcgcacc 
.cgcgcacaag 
caacctgctc 
cttcatctac 
cccgggggct 
cgacctcgtg 
gggcggcggc 
ggccgccacg 
tgccgcggag 
cgcgtcggga 
ttgtggcctg 
ggctgagcgc 
caaggagccg 
ggccgcaccg 
cggcbgcccc 
tagctatggc 
tggtggggac 
ctaccctggc 
cagcgaggag 
cccatgcccg 
cattccgtgc 
cgtggaggag 
cgccggccta 
cctgggagag 
ggccacgctg 
ctgcgggaag 
cctcaagccc 



cgggtggggc 
accgcaggta 
agcagcgggc 
cgaggaagag 
ttggaaaact 
gccgcggagg 
tggggcgcag 
gccccctccc 
cagcctcctg 
gccacggcag. 
tgacagcccc 
ttcccccaac 
tccctacttg 
aaatcgggta 
ccagcaccca 
gccgctcacc 
agggcagccg 
ggaaggggaa 
gctctcacce 
acgatggagg 
aagggcttct 
aacgtgctgg 
aacctggacc 
accggccgcc 
gagccgagcc 
gcgctgtgca 
ggcggcggcg 
ccggtcatcc 
ccgccctcgg 
cccggcccgg 
gacctgtcca 
gagctgcccc 
cctctcgccc 
ccttccgacc 
gacgggccta 
gacgagctgg 
gcggccgtct ■ 
agcctggacg 
accggtagca 
cacctggcct 
ggcaagggct 
gaggaagcgc 
gggccccctt 
ctgctgcggc 
cggcagcacg 
aagttcacgc 
ttcgcgtgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
'660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 . 
1860 
1920 
1980 
2040 
.2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
" 27 60 
2820 



acgcgtgcgg 
cgggcgagaa 
tcatcagcca 
cgggcttggg 
ccgagggcgt 
acaaggcggc 
tggcgctgga 
acaaggcggc 
ccatcgaccg 
gcggccctgg 
ggacaaccgc 
ctgcttcgtg 



catgcggttc 

gccctacgag 

catgaagatg 

ggggctcccc 

ctttgctgtg 

cgcggccgag 

gagcctctac 

cgaggtgctg. 

tttctctccc 

cccgcacccc 

agcgtcgcca 

ccttagctcg 



acgcgccagt 
tgccaggtgt 
cacgccgtgg 
ggcgtccccg 
gctcgcctca 
ctgctggcgc 
ccgctggcca 
agccagggcg 
acctagagcg 
agggagcggc 
cagtggcggc 

g 



accgcctcac 
gcggcggcaa 
ggggcgcggc 
gccccgacgg 
cggccgagca 
agaccacgca 
agttcacggc 
ctcacctggc 
cccctcgcca 
gggggcggcg 
tccacctctc 



» • « 



ggagcacatg 
gttcgcacag 
cggcgcggcc 
"caa'gggcaag 
gctgagcctg 
cttcctgcac 
cgagctgggc 
ggccgggccc 
gcccgctctg 
cgcagggccc 
ggcggcctca 



• • • 



cgcatccact 
caacgcaacc 
ggggcgctgg 
ctcgacttcc 
aagcagcagg 
gaccccaagg 
ctcagccccg 
gacggccgga 
tcgctgctgc 
actgtgcccg 
cctggcctca 



<210> 2 

<211> 2501 . 

<212> DNA- 

<213> Homo Sapiens 



<400> 2 



tttccatagt 
cagcacaaaa 
tctttctccc 
gggtcgccac 
ggatagggtc 
ggctgcaccg 
actcaggaga 
agatcattta 
aagcagtggg 
gcagggatta 
gctatctctg 
aaacccagaa 
• gtgtcgagtt 
aaatggcccc 
ggattccaag 
gaccctcgtc 
gctctgcttg 
cgacgggcgg 
ccgcgcccct 
gcagctgagg 
atctcttgag 
tccccagcac 
ctgctgattg 
.tactcttcct 
cgcgaggggc 
tcattcggtc 
cttctttaaa 
gtcacaatct 
acaggggtca 
ctggagagca 
gcggcaaaat 
aacaaaccat 
agtgggtgac 
ccactggtag 
taaacgttac 
taaagtggct 
attttatacc 
ccttctaaat 
attttacaac 
gaggtccagg 
ggggctatat 
- cttacacatg 



gtaaatgtgt 
ttcaaacatc 
caaacctagc 
tttaggtcct 
tgtttatttt 
ttattgaaag 
tgctcagtaa 
ggaacacttg 
ggtccaggac 
ttacactgac 
gaagcccctc 
aacgtttatt 
aactattctt . 
tggaaggcaa 
ggagccgcgc 
ccgcccccgc 
ttcgtgtgtg 
gtggtactgg 
tgctgaggcg 
tcaccccgct 
gaggaaagag 
cccggtgtgg 
caaaactggg 
gcaattaact 
gggggctgga 
gttcagtgtt 
gagggcgtct 
gttctcgaaa 
ggcgtatgat 
gagaattccg 
ggaaaattgt 
tcataccaac 
ctctgtaacc 
taccttatac 
aaactggaat 
tcttatgctg 
catagctgtc 
gtttttagga 
gcttggtgtt 
gattacattc 
ctgaaaatca 
gtatgcagta 



tcccaccact 
taaacatgat 
tagtgtttgt 
ggtcctcctt 
gacaccatta 
tttaccatct 
atgttgattg 
tcaaaatgtt 
tctgcatttt 
aaatctttac 
aaggataggg 
gagtgaatga 
ctacacaagt 
gtcttccagc 
gctctgcgct 
cattccccgc 
tgtcgttgca 
tcaccggcgc 
cagctggctg 
gaggtggtgg 
ccggaaacac 
gcttcccaag 
tgaaagttct 
ctcttagatc 
attcgccccc 
tgtttccctc 
gataagagcc 
tccactggag 
ggagaaagaa 
aagtggaccg 
tatgtgaaat 
atagatgctg 
tccaagtgca 
ctggaaaggc 
tctgtcaatt 
cctttcagaa 
tggttatctc 
aaacattttg 
ttaaatcttt 
taagacgttt 
ttcaaggata 
cattttacac 



ctctggagta 
cttgctggct 
gctgcctgta 
cagaaagttt 
gctcacttac 
gactctgagt 
aactatgact 
tttgcccctt 
gacagtcccc 
catccctaag 
gccgcatgct 
tgaaacgaca 
cctagcagct 
gagtggagtc 
catcttccta 
ctcctcctgt. 
ggccttattc 
gggggcaggt 
ctcttttcgg 
ggaggggaat 
ctggtctctc 
gtcctgcctg 
ccctgaccct 
tttgcctagt 
tgaagcgcag 
tgtcgtaaga 
cttccccgtt 
tcttggcaga 
aatgggagta 
gccagtggga 
agccccattt 
tgcagtgaga 
gggatcttga 
cctaatgcat 
aattcctatg 
aatgctttca 
tgcatgggca- 
tttacacttt 
tttgacagct 
ttgccatcgc 
gggactgctt 
cagtactcag 



atcctactta 
ttgcttttgt. 
atgcccttct 
ttcctctttc 
acacattggt 
agcttgagga 
.gttctcaaca 
gagattctat 
tgatgtttgc 
ggggactttc 
gtttctctag 
ggtgaataga 
cccattgctt 
actcttaact- 
ccagaaatcg 
cccgcagtcg 
atgggctcac 
gagcatgcga 
gccggcatac 
ggttattctt 
aagcaggtac 
agaggagagg 
tatctgtggg 
cttttaaagg 
agatgtcagc 
ttttaagttc 
ggagtttgta 
ggttgtaagc 
ggatgggcac 
aagttgcctg 
tttaaagtac 
ttttacatta 
cattatgcac 
gaattatttg 
tactttcata 
gtgttgatga 
tgtatttggg 
gcttttattg 
cttggataat 
taaggagact 
cttttgacac. 



2880 
2940 
3000 
3060 
3120- 
3180 
3240 
3300 
3360 
3420 
3480 
3501 



aaaccgtttt 
ggctttaccc 
ttctttgcag 
tccccagcgg 
cacaagtcta" 
tcctatcaaa 
tacaaacgca 
tttgggaggt . 
atgtagaagt 
cttcccaggg 
gtcagcaact 
tgaacgcaag 
ccagccgcag 
acatttccca 
gcaagtcact 
gcgtccagcg 
cgctgaggtt 
aggttggagg 
gcgcgcagcc 
gaggcaccgc 
agcccgcttc - 
ccaggctggg 
catcgattgt 
actgaaaagc 
tcctgaaaag 
gtgagaggac 
tgcttagcaa 
tcaaatgcgc 
atctgaggaa 
tatttcagga 
aaaaaattaa 
gtttctcacc 
ctttgattct 
agttatatat 
tctgtattga 
atagccaagt 
tgtagtcata 
taaataatgt 
tttcatgcag 
ttccttttca 
cattagcata 



60 ■ 
120 
180 
240 
300 
360 
'420 . 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
• 2220 
2280 
2340 
2400 
2460 
2501 
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<210> 3 

<211> 2470 

<212> DNA 

<213> Homo Sapiens 



<400> 3 

aaagatgatt 
agggtacttt 
■ ggttataact 
ttttctctta 
gagaaatttt 
t'tataatgtg 
tcactcctgt 
accccttgcc 
aagaggttat 
ttttagcata 
ttgtaagtgt 
atgaccaggc 
gtgatccctg 
actgtagatt 
tggtagagtc 
gtctgcctta 
gtttaagtat 
aggcaggatt 
cctgaatctt 
tcctcttacc 
taactgccag 
tctttaggta 
agcataggct 
cttatttacc 
tccattttgt 
tggaggatgt 
tcattaatcc 
agtattacta 
aatgggtaaa 
tctgtgagaa 
tttatcttga 
tgttttgaga 
gtttccttta 
aaaatccttt 
tcagttgtta 
agaagataat 
aagactgtga 
tgaggtaaga 
tcttgcaaaa 
atataaagaa 
taaagagaag 
tagaaactag 



aaaagtttaa 
tagcagtctc 
tctgtgtctc 
ttttgtttta 
aaggttcaca 
ctgggtattt 
ggatttttcc 
cattgatcag 
gttacatgta 
ccttttttta 
tcatttacaa 
ctgaaattaa 
cctgcaggac 
ctagccgttg 
ttccaattgc 
ttttgtgtaa 
tgcttaagtg 
ttatttgttc 
acttggttaa 
tggctatctt 
actctgcttc 
tcttttggcc 
attatagtca 
atcttcctat 
taaatgattg 
gtaagattag 
attctggctc 
atataaataa 
atatctttaa 
ggcaaatgta 
actaagactc 
ccaaaaacta 
cccctttctt 
ttaaccttat 
aagcaagttg 
tccaatcatc 
cttttaattg 
cactttactt 
gttagcattt 
aataagtgta 
agaccaaata 



ttgttcatct 
tgctcatctt 
agtaaacttg 
ttattttgat 
ttattcaaat 
tcagacctgc 
tttatttttg 
a'ggaggggtc 
tattttcata 
ttaatcctaa 
attgtgaaat 
cacacaagac 
tccccttcct 
tagaactaac 
tgaactccaa 
tggatttcat 
caaactactt 
caaaatgatt 
tcttaatata 
ttgtatcttc 
atttctcttt 
tcagtttgag 
accttttgaa 
caaaaaggct 
ccttttaatg 
ccttttatca 
taaaattctg 
atttttatca 
aaccttgaag 
ttcagacaca 
cactgtttca 
gcatattaac 
taaaaacaag 
aggtccaagg 
aggcactgaa 
ttacaaaact 
tagtttatcc 
aaaagtgtat 
atatttttaa 
aaactccaat 
tagcatcact 



gaagagttga 
gcccatccgg 
tgcatgccca 
gtaaaattac 
tctgtcagat 
ttattaaaaa 
agatgtctcc 
ttaactatgg 
atataactta 
•taatatcact 
gaattaaaat 
tcaatttttt 
cctcagatgt 
tagatctaag 
tattgtcgtg 
attacagagg 
^aatacttttt 
tgacctaaac 
actcctagca 
tttgtctctc 
gacagtctct 
cacagcagat 
taaatgtgat 
taaagtcttc 
acatcttaga 
aataaaaagt 
tgactatcag 
taattatatt 
aaattattac 
actaaagttc 
tcctcttaga 
acaattcttc 
cagccactaa 
gtagccaagg 
gatggagaac 
ggaaactgag 
atttttattc 
tttaaattaa 
ataagatata 
ggctagttag 
gtactgagg'6 



tttttttatt 

ctctttttgt 

tttttttctc 

ctgttaattt 

ccctacctct 

gatgtaaaac 

tttggctgca 

gtgaacccta 

catttacata 

gtaagttatg 

gaaagggcaa 

tcaaccaaag 

cattggattg 

atgagtcccc 

actagccagt 

cattttttta 

agctattaag 

taaaaagaga 

ttctataatt 

ctcttctttc 

actcctaagg 

cccaagacca 

tgaactttat 

atttaatgct 

acttcagaac 

gtgaaatgga 

ataaaattca 

tcctaagttt 

ttgatagaaa 

tctcttctat 

tgctgctact. 

ttaaacgtct 

attttttagt 

atggctgcag 

tcaaatcttc. 

gcatctaata 

agtattccct 

gcaataatat 

ttgaattcat 

ttcttagttc 

aaggttttct 



cctgtaataa 
ggttgtgtaa 
tgttactacc 
tatttgaaat 
gtcatatggt 
aaaataatga 
ttacttcttc 
tatcttactg 
gtacttttat 
ttgaagcaga 
agattaaatc, 
acttttgtag 
taccaggttt 
tgatttcctt 
gttacaacct 
atgtcaagat 
taattaagat 
atgtggatct 
cttcctaaag 
ccagtcataa 
tcatccattc 
catatgccat 
gttagtaatt 
ctccttcatg 
tatttcacca 
atatgtaatc 
gaaataaaat 
tgcctgtaag 
gtttaatcca 
tttaatttca 
tgaacaatat 
taagagtttt 
agtgaatttc 
cttcatatga 
gacaagagct 
tgaaggtatc 
cttgtaaact 
gtaaactctt 
tcagtgaatc 
tttttaagat 
gtgtagttca 



<210> 4 

<211> 7001 

<212> DNA 

<213> Homo- Sapiens 



<400>- 4 

aatgcaatgg 
gtgtggaagg 
ctggagagac 
gaatggtgac 
•ctggagacac 
cagtaagtca 



aaaaagagag 
caagggaaaa 
agtgatgctg 
tcatatttga 
attcaacgga 
cggctggaac 



attgtaaagc 
tcagccctcg 
ggcacagaca 
acaagcctgc 
ggagccagat 
gtgtgtgtcc 



tagaaggctt 
aagaagacag 
cggggaagtt 
aatgpccagc 
caatctttac 
agcaggagag 



aggaattgcc 
tgagatttta 
gagaggaaca 
agaccgctgg 
ccttcttcac 
ggtagggagg 



tcttgattag 
atctgggtgg 
ccatgtttga 
aaaagtgggg 
ctgagagagc 
gaagccaaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800- 
1860 
1920 
1980 
2040 
• 2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 



60 
120 
180 
240- 
300 
360 



_ 4<- • • 





gagctgggag 
gtatactttc 
aagagaatgc 
ctttgattgt 
ttcctatgtc 
tcctctctcc 
tggccattgt 
ttcaatattt 
gtgctcaaac 
ggcatcatta 
cataaatctc 
gtgattagag 
gggtgtcatg 
gttagtacta 
gccactggga 
atcagtttaa 
tacgtatttt 
gggtttggtt 
ccaaaacatt 
gttcagagac 
tttattgtta 
atccatacac 
tgtttttcct 
gtaggggcaa 
ctgctccctg 
tgtgcagtag 
tgcttcccga 
tcacacactg 
cccttgactt 
aggcttgttt 
tggctagtgt 
tgtgtctcct 
■ caggttcaca. 
ttgaaaattg 
gtaatgattg 
ttatttgctg 
gcatattgtc 
aaatagtcaa 
acttcgtcag 
gtatcagagt 
attatagacg 
tatatatata 
aattcagtta 
tagcccaagt 
attttaaaaa 
ttcgtgacct 
gttttggttg 
gtttttctga 
ccaacataaa 
actttgggag 
ggcaaaatgc. 
ccccctgtca 
cgcacacccc 
tatgtacaca 
aacagagata 
cccctggatc 
gatcctcgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgca 
gctggcggag 
cggacccgca 
gctgcgtcgc 



cccgagtgaa 
ccacccatgc 
tggagagaaa 
aagtctctga 
aatagtgaac 
tccctccctg 
tgacctacag 
atttatatcc 
actgcaatac 
acatttttat 
tcagtaactg 
tcatttaata 
tgtcctggga 
tggccgagag 
aatgagagac 
aacaaaggta 
tcacatgttt 
aacgaagtgg 
tattccaaag 
tgggggctag 
aattatattc 
ttttgactgg- 
cttgcttggg 
aggggctggg 
tgagcagaca 
cccagctgcg 
atccctgcca 
agccactcgc 
ctacaagccc 
tgatttctct 
ctgtatccta 
ttttctttca 
ttttttaaga 
aaagtgtgat 
ctggccataa 
acaatctata 
caacaatggt 
aaatgtaact 
atactctgtt 
ctattaggct 
catgatatac 
tatatacaca- 
aaaacttttt 
gaaccgagaa 
tctgttagct 
gaggttatgt 
cataaccaac 
atcatccttc 
cacacaagtc 
cattttgcag 
agctcttcct 
gttcccccag 
attctatctg 
ctacgtattt 
tatcggagtc 
cgtctttcgc 
cgcccccgcc 
aagttcaggg 
taagccaatg 
tatgagctcg 
ggcgttcgtc 
ggctcccggg 
ctctaacctc 



gtttttgcca 
tcaccaagcc 
gtggttaaga 
acaatcaaag 
aattgcagat 
aatactcttg 
gaggcaccac 
agtatttatt 
tgggggtctt 
ggctttctat 
catagtgtcc 
tccaaggtgg 
ccctgagcgt 
aatagttgag 
ctcgttccca 
tctaccttta 
acagaaagca 
aggagtatta 
tctggtttcc 
ggccagtaag 
tttcggaaaa 
cattctattt 
gtggcagcag 
gtttctcctc 
gcaagtctcc 
tgtctgccgg 
ttccacgcac 
acatgcgagc 
atggaacatt 
ctctgtagct 
gcagggagat 
ggacttgtag 
gccctatgaa 
ttatggaaat 
agggaaatat 
gtcattaatg 
ctatttttgt 
cttagtaaca 
gggtgtttgg 
gcagcaaagg 
ttcacctatt 
tatatgtgtg 
tctatttgtt 
gatcgagttg 
ggaccagacc 
ttggtatgaa 
ctgtggaagg 
acatgagaat 
aggctgagag 
aggaagaaac 
atatgtatac 
ctgctaaata 
ccctatctcg 
ctagccaacg 
tggcacgggg 
gtttatttta 
ccctggccgt 
aagctgctct 
tcagggcaag 
ggagaccagt 
ctgggactgc 
gcagggccgg 
•gggctgtgct 



aaggcagaag 
cagggacaag 
aaactgcctt 
cctgtgagga 
cccctttcct 
cttttttctg 
tgtcaccfaac 
ttcaatactg 
tcaaagcaca 
ttcttttttg 
cagggccaga 
tgactaatgt 
gggcactcta 
aaagtggtca 
atcacggtca 
tcttatgttc 
gtcaactgag 
catttcagct 
tggtgcaatc 
g'catttgatc 
accacccatg 
tagccgtaag 
aaagcatagg 
cccagtacag 
cctcactccc 
gaggggctgc 
aaacacatcc 
acattccttc 
tctggaaaga 
ttagcatttt 
gaggattgct 
gattctttgt. 
gtgctttttg 
taaatcatct 
ctgcgatgca 
ctaaactcga 
aagaattaga 
gtaagcttgg 
gatagcaatt 
aagtttattc 
.ttttgtctcc 
tgtgtatgtg 
tttcatctgg 
taggactaaa 
gacaatgtaa 
aaggtcacat 
catgaacacc 
tcctaatggg 
aatctcagaa 
tgaggtcctg 
cctgaatctc 
tagctgtctg 
gttacagtgt 
aggaggggga 
cacataaggc 
agcccagtct 
gaaactcagc. 
ttgggatcgc 
"gcaacagtcc 
acttaaagtt 
acttgctccc 
ggccagagct 
ctttttccag 



aggaaagtcg 
gctcaccaag 
tactgaactt 
gacagctaac 
ttccttctcc 
ggactggtct 
aaagggtaac 
actatggaga 
aaaacatata 
tactggtctc 
gaccggccac 
ctggcaacaa 
ggagcacctc 
agaggtggat 
gtgcaactcg 
atatcctagg 
ctattcatgg 
ggaaacacat 
ggaggcatgg 
cacatgtatc 
tcctattttg 
actatgattc 
gtactttcca 
ctttctctgg 
cactgccatt 
caagtgccct 
acacactctc 
cttccttctc 
cgttcttgat 
gagaaagcaa 
gttctccatg 
gccatttgca 
catgtgtttt 
gtaaaaaatt 
cctaatgtgt 
ttttggcttc 
taaaatgtat 
catttagata 
aaaacaaagt 
aaaagtataa 
ttaatatgta 
cgtgtgcatg 
atatttgatt 
ggatagacat 
cataattgcc 
tttatattca 
catgtgcgcc 
accaagtaca 
ggttgtggaa 
gcaggttgca 
cgcccccttc' 
tggctggctg 
agtcctcccc 
atcaaacaga 
agcacattag 
tccctgggcc 
ctctatccag 
tccaaatcga 
ctggccgtcc 
ggaggcccgg 
gtcgggtcgc 
cgcgtgtcgg 
gtggcccgcc 



gcgtagcaca 
atgagtttgg 
cttgggctaa 
cttcttattc 
tttcccctgt 
agagcatggg 
agtctttctt 
gagctctcct 
tttgcatgat 
aagagccact 
tcctggcatt 
agcctccatt 
agtattgcgt 
ccatgtgaac 
aaagcctaaa 
cttttaataa 
aaaggtttgt 
-ccctagaatg 
caatgcctct 
ccagaaggct 
taaacttgat . 
acagcaagcc 
gcctccaagg '* 
ctgtgccaca 
catccagcgc 
gcctactggc 
tctgcctagt 
actctctcgg 
ccagcagggt • 
cttacctttc 
ggggtatgtg* 
tataatttgg 
aaaaaggcat 
gctttggaaa 
tttt.aaccct 
agctacattt 
acttgatata 
gaccatgaac 
attgatagtt 
actatccaag 
tatatatata 
tttaactttt . 
ctgcatatcc 
gcagaaatgc 
aaagctttgg 
gttttctgaa 
ctaaccaaag 
gtactgtggt 
gggtctatct. 
ttctcctgat 
ccctcagatg 
cgtatgcaac 
agggtcatcc 
aagagagaca 
agaaagccgg 
acctttagca 
cagcgacgac 
gttgtgcctg 
tccagcacct 
gagcccagga 
ccggcttcac 
cgggacatgc 
ggtttctgag 



420 
480 
540 
600 
660 
720 
780 
840 
'900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
' 1740 
1800 
1860 
1920 
1980 
2040 
'2100 
2160 

2220 

2280* 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

* 2880 
2940 
3000 
3060 
3120 

' 3180 
3240 
3300 
3360 . 
3420 
3480 
- 3540 
3600 
3660 
3720 

• 3780 
3840 
3900 
3960 
4020 
4080 
4140 



ccttctgccc 
ccctccacac 
agcccctgaa 
tggacagcag 
ccgcggccgc 
ctgaggctgc 
ctccgagccc 
acggccagca 
ccggcccgcc 
gcccggcagg 
cgcgggacgc 
ggttctgcgt 
aaacgtactc 
ctaacaaaac 
tccaggcaaa 
aaagttgggg 
ttttgtttaa 
agctatgatg 
cccacgtcct 
ctggctgagt 
taggtgtccc 
ctgccagccc- 
t'cagccaggg 
. gagaccactt 
acaagaattg 
tttgtttttt 
gtgaacacct 
agggcgccac 
ctggtgtccc 
tttccaggga 
actgaagggc 
gagcatccca 
aacttttaaa 
agagaaaaag 
gattaagtct 
gatgtgaatt 
tcagttcttt 
cttggtagat 
gggatcgcat 
tgccctatct 
agagacaagg 
ttcagtttta 
agaattctgc 
gacagatatt 
aaaggaaagg 
agttttgggg 
tcagtttagt 
aaatatcaga 



tgcggggaca 
caaagcatct 
ccgtccgcag 
caagcccgcc 
cgccaacgcg 
ggcgttcggc 
gctgatgcta 
ggtgccctac 
ggcattctac 
agggagggag 
gcgacccgag 
gcagcccgcg 
•tccacccact 
acacgttgga 
taaacacggg 
gtgtttggag 
tgtgctcccc 
atgactttct 
ggggctttag 
ccgcgctgga 
gcgagtttaa 
agtgacaagc 
tctgcgcctc 
tgtgact'tca 
ttctttctcc 
gttttttgtt 
tcctaggtgg 
atggctgggt 
tatatctttc 
tatttatagc 
agattctggg 
cagccacgca 
ggttggcata 
gatttcttct 
aattccacac 
ccataacctt 
tattgcaaat 
tatttactgc" 
cttattcaaa 
tacactttct 
aacttaaaaa 
gaatgatttt 
ctgttattat 
atttagtatt 
attgattggg 
ctgaaagcag 
gagtctagaa 
gtgcatggta 



cggtctgcac 
gggatggccc 
ctcaagatcc 
gtgtacaact 
caggtctacg 
tccaacggcc 
ctgcacccgc 
tacctggaga 
aggtacccgc 
ggagggaggg 
ggtgcgcgca 
ctgcgttcag 
taccgtccgt 
aaaccagaca 
gcgctttgag 
ttagcagagc 
aaatttcctt 
gggaagcgat- 
agagcgattg 
gcggattgct 
aacaagccat 
ctgtcccacc 
gcagccactg 
atggcgaagg 
ctctcctctc 
ttcctgatgg 
ccacaggaca 
.cctgaattag 
tgccttgtaa 
agaaggaagg 
ggtattcagt 
gatccaagct 
tttcaaatta 
gcccaccttt 
.aacatttaaa 
atgatggact 
taatgctgct 
cttgctaata 
attgttcatc 
gcattacatg 
ttaaaaaaaa 
tcctgtgttt 
ccagcaagga 
t.ttcccctct 
ctcttttagg 
tgttttcaaa 
tattttttaa 
tttcgtaagg 



cctgcccgcg 
tactgcatca 
ccctggagcg 
accccgaggg 
gtcagaccgg 

tggggggttt 
cgccgcagct 
acgagcccag 
gcccgcgccg 
agaagggaga 
gggagcccgg 
agtcaagttc 
gcgagaggca 
aagcagcagt 
tcacttggga 
tcagcagagt 
tcatctagac 
tcctgtcacc 
ggagttgaat 
ggcatgtgac 
atggaagcac 
cggggagaat 
tggaaggagc 
ttgtgtgtcc 
cctcccattt 
gcctgcagag 
ggtgtacccc 
gcatttccca 
gccgtggacc 
ggactaaagt 
gtgcatcttc 
gtctttactc 
attttacttg 
ttttttttaa 
actgcttgat 
ccagaaacca 
gaatttcaat 
attacaaagt 
atcccagtga 
atattcaaac 
aacttgcaca 
tatttcccgg 
aaagaagcat 
cagtttttct 
aaactttaag 
ctgcttgtca 
aaggactaaa 
ataagttttg 



gccacggacc 
gatccaaggg 
gcccctgggc 
cgccgcctac 
cctcccctac 
ccccccactc 
gtcgcctttc 
cggctacacg 
cccgtcgggg 
gcctagggag 
ggcgcgcggc 
tctcgccg'gg 
gacccgaaag 
tatttgtggg 
aggtctcgct 
tttatttatc 
tatttgattg 
cgctttcccc 
gggtctgatt 
ttctgacagc 
aagtgcttaa. 
gccccggagt 
gcggccggtc 
tcattttaat 
tctcttgccc 
ggattaggtg 
ggactgggtt 
actgtacact 
agtttttgtt 
gcagtttggc 
agccgccttg 
aaaagacaaa 
ttttaattta 
atggaagaac 
gtgaaggaag 
ttttcttccc 
gggcactaat 
gaacctggtc 
taagtggtat 
actcttagaa 
aatgggactc 
attatctttc 
ctatgcaagt 
gcttaaatga 
tttcttaagt 
tgacccagag 
atggaaagga 
t 



atgaccatga 
aacgagctgg 
gaggtgtacc 
gagttcaacg 
ggccccgggt 
aacagcgtgt 
ctgcagcccc 
gtgcgcgagg 
tggccgccgc 
ctgcgggagc 
ccagcccggg 
cagctgaaaa 
cccgggcttc 
ggaaaacacc 
cttggcattt 
cttttaatgt 
gaaatatgtc 
tcctccccac 
tcggagttag 
cggaaatttg 
aaataatctc 
ggcgtgcggg 
caggacacag 
ttttttccct 
agtttctcct 
ggcgcttctg 
tggaagcttc 
ggtatccgga 
cagtattctg 
cccagaggat 
gagaaattta 
caatgaacaa 
gggttaaaac 
aaagtacagc 
gcactggtat 
tatttaattt 
gagactgctc 
aaatacagag 
cagtgtaata 
taataaaaaa 
tgtgtggaaa 
ctcttttgtt 
tcttcatatg 
ctctgggtat 
agttctcaaa 
ggtcatgaac 
atataataga 



4200 

4260 

4320 

4380 

4440 

4500 . 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 ' 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 
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<213> Homo Sapiens 



<4'00> 5- 

ccaagtcaga 
actccacagg 
ctcctcatgt 
caaagcacca 
gtctggaagc 
ttgtcttcca 
ctagtgacaa 



tgttccccaa 
cctaatacct 
cattagatgg 
aggcaattct 
ctttgcttta 
gcacaattcc 
ccaagaaact 



ttacctgtgg 
tggagccaga 
actcctgcat 
tcctggcttc 
gttacaatgc 
tcctccaagc 
.aagaactatg 



acaggtcagg 
aagcttccag 
ctccagaaga 
ttgggacaac 
ctatacactc 
cttactgtag 
tactcacgtt 



catattctga 
gtaaaaagtc 
ttttccacac 
cctaggcttt 
ctggaactgt 
ctacagccca 
cacctcccca 



gtctaatttc 
tgaagggggc 
caggaaagat 
ggcatgagtg 
tttgcagggc 
tcagtcctgt 
gagtcatttt 



60 
120 
180 
240 
300 
'360 
420 



ccttcaggac 
gctgctcatc 
cagggccagc 
cagggtgggg 
ttgcgggggt 
cttgtgcgtg- 
cggcgcctga 
ttcttgctgg 
gtgtaagaac 
ggtgagatgt 
cccaagataa 
tccccacgcc 
ctgggcattg 
ccactccaca 
cagggcttct 
cctgtccctg 
ttttgcatgc 
gcctagggcc 
ggcacatggc 
taaaatcagg 
taggaggatc 
atctctacta 
aactgttcag 
agctattatc 
acaaaaaaca 
ggccgaggcg 
aaaccccgtc 
ccagctactt 
tgaactgaga 
aaaaaaaaaa 
tgagttggag 
cggggccatg 
ccaccagtgg 
aatcactgtg 
ggctccaatc 
ctgtctggag 
cccaggctgt. 
agcgattctc 
acccggctaa 
cttgaactcc 
ggcgtgaacc 
tctagggact 
accgactttc 
gtcagatagg 
aatactttgt 
.caggagagcc 
tgatggggac 
tccttaccca 
gccagggcct 
ccccaagtat 
aaccttactg 
tggaaaatga 
cgggaagaag 
agtgccaggg 
gctaacttgt 
cttcccttcc 
ccttcccttc 
cactgcacag 
tgcctgctgg 
tttcccttct 
• gttttttttt 
ttggctcatt 
• tagctgggat 



aaagctcagg 
atccaacagc 
tgctcagccc 
tgggaagccc 
catccaccac 
acagcacatg 
cacttgtgcc 
agggcacaga 
cgtccttgtt 
cagtctactt 
cctcagttgt 
cccttcctga 
tactccttga 
ggctcgtcct 
ctgaaaaatg 
atgtacatat 
tctgggtgga 
tcagcactga 
caggggccag 
ctgggtgtgg 
acttgatcct 
aaaaaagaaa 
gaggctgagg 
gtgtcactgc 
aaaacaggct 
ggtggatcac 
tctactaaaa 
aggaggctga 
tcgtgccatc 
aatcagtaaa 
tgagggtttg 
acccactgca 
acagagtagt 
acatgatgtt 
tgccagaggt 
atttggccca 
agtgcagtgg 
ctgcctcaga 
tttttgtatt 
tgacctcaag 
atcgtacceg 
ggattcttgc 
ttctcccaga 
ccctagttag 
cccagaagaa 
agggttctcc 
aaatggagag 
gaacccgctg 
tacaggaaaa 
ccaaaggtct 
gggacagtgg 
gggctggagg 
tacccagatg 
caagaaggaa 
tcttgtgatg 
tggatgaccc 
tggtgatgaa 
cccttccagg 
aagaggattt 
gcaccatagg 
tgagacagag 
gcaatctctg 
tacaggtgcc 



gccttgtcac 
ttccgc'aagt 
ctgcatgctc 
accttggccg 
caagaacttt 
caggtgcttg 
ccgtcctggg 
gacagggcgc 
ctaaagaaat 
ggggctaggc 
gaccctctga 
gcagtcaact 
tcttctgctc 
gcaagatggg 
gcctctgggg 
acatagctgg 
taagggaaag 
atgtagcctt 
gaacagctga 
tggctcacat 
caggagtttg 
aaaaaaaatt 
tgggaggatc 
actccagcct 
gggcacgttg 
ctgaggtcag 
atacaaaatt 
ggcaggagaa 
gcactccagc 
atcacacc.tc 
tcctggagaa 
gggtgagagg 
acttggagcc 
taaccttgac 
acacaggcag 
aagatttttt 
ctggatcttg 
ctcccgagta 
tttagtagag 
tgatccgcct 
acccagagat 
cggaagggtg 
ctcccttggc 
aatgcgagtg 
ttcccacaag 
ctgactccgc 
gacagatgag 
ctattctgca 
aggtaaaggg 
cctggatgga 
gcaaccttgt 
ctgtgaataa 
ggaggagtac 
gatcattttg 
gctctgagct' 
aggagccctg 
cctccctatc 
attctaaagg 
caacagccat 
gacccaccct 
tctcgctctg 
cctcctgggf 
tgccaccaca 



tgggccctgc 
acactgtgaa 
aggtgcatgc 
tgacgctcag 
cgcagcaggg 
acagcatcct 
ccccgccggg 
accagcttca 
agagaaacca 
tgggtatgca 
gtatcaggca 
caccaagctc 
aatctcagct 
ccagcatgga 
cagtctttgg 
tgcccaccct 
acagaatcat 
aaggatacca 
gggactctga 
ctgcaatccc 
agaccagctt 
agccaggtgt 
g'tttgagccc 
ggggaactga 
gctcacgcct 
gagtttgaga 
agccaggcgt 
ttgctcaaac 
ctgggcaaca 
aattgcacat 
ggcagcccat 
agtggagagt 
agttctccat 
ccaagctgca 
cttcctggtg 
ttttttttga. 
gctcactgca 
gctgggacta 
atgggatttc 
gccttcacct 
ttttaactcg 
gagtgtggga 
cccactgcat 
tcacagacac 
gtttcctgta 
cctggagtcc 
ggagcagggt 
gccagcaagg 
ggaggggtgg 
gaagagcact 
cgggttagta 
agcagtggat 
taggggcagg 
tttgcagaac 
cctacctaac 
■ggctgggatg 
ttcactcaga 
aggaagagtt 
cccagtcgga 
ccactctacc 
ttttccaggc 
tcaagcaatt 
actggctaat 



caggagccgc 
ggggaagtaa 
gtatataccc 
cgcgctcaaa 
•cctcaatgac 
tgggcaggta 
catcctgcaa 
gctgaacctt 
aaccttgata 
caaattactg 
catagctggg 
atgaagaggt 
tgagaaaggt 
cacagggccc 
aaactgactg 
gaacccacca 
ttggcttctc 
cagaagcagg 
agagggactc 
agcattttgg 
gggcaacata 
ggtggtgtgc 
gggagattgc 
gtgagaccct 
gtaattctag 
ccagcctgac 
ggtggcacat 
tcgggaggtg 
agagcgaaac 
• tctgatcaca 
ttttctcctc 
ggtgcacatc 
gtctcacaca 
taaaaggcag 
ggtttctgca 
gacgaagcct 
agttctgcct 
caggcgcgtg 
accacattgg 
cccaaagtgc 
accactcact 
cagggcagcc 
cagccttact 
agctaagctc 
gaatgatctt 
ccttaagcac 
ggagcgtttt 
atgtggggct 
gaatttaagc 
ggagtaaaaa 
aaa'acaaatg, 
gtgtttgttc 
agaaatgcca 
agggagggca 
aatgagaaag 
cagtgacctc 
aaacagactt 
tctttttctg 
tgcacagcag 
actgtccata 
tggagtgcag 
ctctgcttca 
tttttgtatt 



agccgcaggg 
tgatcagaga 
tcacataggg 
gagtgcaaac 
ttcacgtgcc 
aaaggaagtg 
ggagggtggc 
gatgaagcct 
ataggttcca 
cttgcgccca 
tacctgctcc 
tgctgttgat 
caggtgtctc 
ttgaggaacc 
cctttggccc 
ctgctcctgg 
tctgctgcct 
ggcaactgaa 
tcatttaaag 
gaggctaagg 
gcaagacctc 
ctgtagtccc 
agctacagta 
gcttcaaaac 
cactttggga 
caacatggag 
gcctgtaatc 
gaggttgcag. 
tcggtctcaa 
gcaccctagt 
tgccccggca 
agtagtccag 
tagtgagaaa 
ctttaggcca 
cctgcctgtg 
cactctgtcg 
cctgggttca 
ccacaacaac 
ccaggttggt 
tgggattaca 
ccccacctca 
agggctctga 
tcctgttgac 
agcgctgacc 
gtgcctagcc 
ttaaaccatc 
agcagaatgc 
aagaactaag . 
tcattttctt 
ccccagtaca 
gtgtgggccc 
agtacaccaa 
gacagactct 
cagggatggt 
cttgctcctt- 
atttccagcc 
ggattagagg . 
tttccaaagc 
gaccatggaa 
aaaactgatg 
tggtgcgatc 
gcctcccaag 
tttagtcgag 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 • 
3060 
3120 
3180 
3240 
3300 
3360 
3420 • 
3480 ' 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
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acggggtttc 
ctcggctttc 
tgtttttttc. 
tttttttttt 
cccagtctcg 
ctccccagta 
tcagtacaga 
gatctgcctg 
ccttctttct 
acgatcttgg 
ctgggactac 
ggtttcacca 
tggcctccca 
ttttgagaca 
actgcaacct 
ggattacagg 
ttttcccacc 
. ggggtctaca 
agcaccaccc 
cactttcttt 
agtccctgtt 
•atggcatctg 
. acagtcctgg 
gctttccccc 
tttcctatct 
ccacattcca 
catacaggag 
tacttctcaa 
gagggagcag 
tttagtcata 
ccgtgttcaa 
tggaatcggg 
ccacgtagcc 
ggcggagctc 
tcccggcgcg 
tccgagtcct 
gaaggcgcct 
agccgccaac 
cccgggccag 
cttgtctcat 
acctcatcgc 
gccagcgtcc 
cgggaagtgc 
ggagtacttg 
ctctgtgccg 
ctcgagaatg 
gcgctcacca 
caacagtcca 
gagaggcctg 
ggaagcccca 
ggttacctca 
cggaaggcca 
cttccccttt 
tcacccaggc 
tcaagcgatt 
•gcccagctaa 
ctcaaactcc 
ggcttgagcc 
tacttgaagg 
ctactcccct 
.ccctctaccc 
ctcagcgatc 
cagcatcatt 



accattttgg 
caaagtgctg 
ttttttttta 
ttttttttga 
actcactgca 
gctgggatta 
cggggttttg 
ccttggcctc 
tttttttgag 
ctcactgcaa 
agtcacacac 
tgttggccag 
aagcattggc 
gagtctcact 
ctgcctccca 
tgtgcacctc 
tttccacagt 
ctcaactcaa 
attttgtaag 
acttaaataa 
tctttttcct 
gcacacagtt 
gttagtaacc 
ttcctgtggg 
attcccaaat 
gcagctggtg 
tgcagatgga 
ttcctggcga 
gagcattcga 
gctggataat 
cctgtctgtg 
gtcttacgca 
aggagggtgg 
cagcgaccgc 
gcctgcgagc 
cttggctgca 
ccgcctcgcc 
caccgccccg 
agccgcgccg 
tggggcagga 
tccggagctc 
gggcaagcgc 
gcgtgcgcgg 
cggagccggc 
ccgggaaatc 
cctgcactgt 
cgttcgtgtc 
ggcagacgag 
gcggtgaggc 
gggagagggc 
cactgctacg 
caggcagaga 
cctcattggc 
tggagtgcaa 
cttctgcctc 
tttttgtatt 
tgatctcagg 
accgcacccg 
aaaaactaaa 
cagaccaaga 
actccagcca 
tgtgtactac 
tgaagcttgc 



ccaggctggt 
ggattaaagg 
acatataact 
gacagagtct 
acctctgtct 
caggcacaca 
ctatgttggc 
ccaaaatgct 
acagagcctc 
cctttgcctc 
caccatgccc 
gctgacctcg 
attagaggtg 
ctgtcactca 
ggttcaagtg 
cgtggctagc 
ggaaaatggc 
cccttgcacc 
gaattttgta 
ggatttccgc 
■ggtgcctagg 
ctccgataat 
tggatctctt 
taaagtgctc 
ctttctccag 



acaggccaga 
ggagcaggat 
ggactcttcc 
acgcggaaat 
gcccggctca 
acagaaacca 
cggttccgcg 
ggctgcccac 
ttcccctccc 
tagcgaggtt 
gtagccactg 
catagccgta 
gtcgcgtgcg 
caaccgttaa 
acgccggggc 
cactcacaga 
acaagagtgg 
agcctgggtc 
aatccaggct 
ggcaattaga 
ggcctgccca 
ccgctccacc 
cgcgcggcag 
ggaggagctc 
cgctgctcgc 
cggactctaa 
ggcctgctca 
aattaaaaaa 
tggcgcgatt 
agcctcctga 
tttagtagag 
tgatccaccc 
gcccttcact- 
cacctttaac 
cactggtctc 
ggcttgcctc 
caaatggaat 
tccctttgac 



cttgaactcc 
tgtgagccac 
tgggacttct 
tgctctctca 
tetgggttca 
ccaccatggc 
ctggcaggtc 
gagattacag 
accctgtcac 
ccggttgaag 
ggctaatttt 
aattcctgat 
tgagccaccg 
ggctggagtg 
atccttctgc 
cctccttttc 
tcaggactga 
caagctctgg 
aaattttatc 
cccatttctg 
cctggttctc 
ggggatacct 
tttccacctc 
ctgaatgctt 
atccacccta 
acagggaaga 
ctctctctgc 
ctctccacat 
cgaggtgcta 
ggtctaccac 

agggggcccc 
ggcaggtccc 
cgacccagga 
gcccgccggc 
cgcgcggtga 
ctcgtcgtgc • 
cccgcccgtc 
tgcgtgtacg 
gactgaaacg 

ggggacacgc 

ccccacctac 
cctctggccg 
agcctgggcc 
cccctcccag 
acgctccttg 
tcctcgccct 
gcgggttccc 
cggtagtggc 
caggtcgggg 
caggctccgc 
tgttggccac 
acagttggat 
acaajcaacaa 
tcggctcacc 
gtacctagga 
acggggtttc 
gcctcggcct 
. gggaacgtat 
ctacgtctgc 
tatacactct 
ttcctccagg 
ccagtgttcc 
tcccaggggc 



tgacctcatg 
tgcacctggc 
cagcctccta 
tccaggctgg 
agtgatactc 
cagataattt 
tcgaactctt 
gcatgagtca 
ccaggttgga 
tgattcagtc 
tgtatgttta 
tgcaaatgat 
tacttggctt 
cagtggcacg 
ctcaccctcc 
aattggttag 
ctgacatgaa 
gctaagattt 
tgaagcatca 
ccaggcatac 
tgagcctggt 
aggaggttcc 
tttaggcatt 
atggtccaaa 
gaggaaggga 
ggtgagggct 
ctctcaagtt 
cctcccctag 
gtccaaactg 
aagccataca 
ggcacccagc" 
cggccaggac 
cgcggcaacg 
accccctggc 
agtactgctc 
• tgctccaggt 
ccccagtcct 
cgtgtcagtg 
tagatcgccg 
acgcttcgcc 
cacagggaac 
gaggcgaggg 
cgggtccgct 
cccccgcgca 
cgcgcggcac 
tcccatacgc 
agcccaggtc 
aggtgaactt 
aaatgtcccg 
aggcccgacc 
ctgggcgtct 
ctctatcgcc 
aaaactgcgt 
gcaacctccg 
ttacaggcgc 
accatgttag 
tccaaagtgc 
atggaataca 
cctgtggttg 
aatccttcgc 
aaactgcccg 
attctccatt 
tacactctcc 



atccacccac 
ctaaaactga 
ttctttcttt 
aatgcagtgg; 
ctgcctcagc 
ttttgtattt 
ggcctcaagt 
ccaagcccag 
gtgcagtggc 
tcccaagtag 
gtagagatag 
ccacctgcct 
ccttttctat 
atcttggctc 
caagtagctg 
tgtcttgtgg 
gacaagccca 
tggcgtgctg 
ctcacaactc 
tgagcttcac 
ggtcacacca 
gagacacctt 
ttataatcta 
acaagacttc 
acagaatctt 
cagctggctc 
ttcctaaaca 
tctccccaag. 
ctcggtcggc 
gctgcttttt 
atctaggcgg 
ccgcggggag 
gaccggggag 
tcccacctgg 
gagctccgag 
catttcgaaa 
gcgcgtccgt 
tgcgcgtgcg 
ggatctagct 
cccaggaatg 
.gggggcgggt 
cgggaaggtg • 
tgcagcgggt 
gaattagcct 
ccaggcagcc 
cctcggcccc 
ccggggcccg 
gcaatctgca 
gagattgaag 
tatctcagtg 
ggaaaccggc 
tagcacagaa 
cttgcttttg 
cctcctgggt 
ccgccaccat 
ccaggctggt 
• tgggactaca 
tctgcccatt 
tcacctgtct 
cttcactctc' 
ggacagggtc 
ctcaccccct 
cagttttcct 



4260 
4320 
4380 
•4440 
4500 
4560 
4620 
4680. 
4740' 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
; 5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 ' 
.6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660. 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
. 7140 
7200 
7260 
7320 
• 7380 
7440 
7500 
• 7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 



o* • ••• • 



cctaccccct 
acgatgaagt 
agatccacgg 
cactaggatg 
cagcctcact 
cagttactgg 
taaacatgcc 
cgctgcaatg 
actctgtcgc 
ctgtgttcaa 
cactacgcct 
caggctggtc 
gggattacag 
.actttgtgct 
atgcacacac 
atgctcaacc 
tagggtccac 
ggactgtaaa 
gcttgbtggc 
ttagcatctg 
cctgcttgca 
taatttccct 
ctgggaagat 
ttttaaagat 
atctgtggcc 
agcgatgggg 
ccgcgacttg 
gcgcggggcc 
tcgccacaga 
cggccaggga 
ttggcacgct 
* .agctcaatga 
gggcgctagc 
cccaggatcc 
gtgcagggac 
cttgcccttc 
cggcgaccgc 
tagaggaaga 
acacttggcc 
gagggagatc 
ccggcagagg 
gcatgggtgg . 
caggacacaa 
tgggtattga 
ttctaggata 
' aagcctctgc 
■ ccagcctgta 
ggtgggggca 
ccacaggcct 
ctggagctgg 
ccagatttct 



gcagctcctg 
ctgggctcag 
ctgaaacact 
tctaaaagac 
gctacacttg . 
agacaaaagc 
caaatcatcc 
tcccagcctt 
caaggctgga 
gcaattctcc 
ggctcatttt 
ttgaactcct 
gcatgagcca 
ccccccagct 
agacctggga 
cagactccat 
tcccagcagt 
agcacccctg 
gattcccagg 
attcactggg 

ggggg cccac 

tcaccctaaa 
ggactccgag 
aagaaactta 
cagatacgag 
cgaaagtaac 
acccgcggcg 
actactcacg 
ggtc<gcacca 
ccagctgccg 
ccagccgggt 
gctcaggctc 
aagcgcgggc 
agactgggcg 
tgtggggccc 
cttccctcct 
aaagcaccac 
gggtccccac 
tacccactgg 
ggtcagcagc 
ctggaccgtg 
tcggccctca 
gtctttccat 
ggatggcaat 
ccctcacctg 
actgcagtaa 
ggtctcagcc 
ccccacccac 
taccttcgcg 
tgcttggcaa 
gggatctagg 



ctcagctcct 
tcctgatcac 
gacctaaacc 
gtttcaagtg 
cccagcttcc 
ccaaacttat 
tgcttcttat 
gttctttttt 
gggcagtggc 
cacctcagcc 
tttttatttt 
gacctcaggt 
ccgcgctcgg 
gaatttgtga 
cactatccat 
gttacaataa 
tgattcaaca 
gtgattccag 
ggtgcagaag 
tcagaaaagg 
caaaggttct 
ggttctggag 
gacagtaggc 
aaatcagcgt 
tggagtgcga 
ggacctagtc 
actgcgctgc 
cgcgcactgc 
cgtgtgcgtg 
tgtggggttg 
gcggcccttc 
ccccgacatg 
cgggcggggc" 
gcgggaagga 
gaaggcgggg 
tcgtcccctc 
gcggagatac 
atccggccct 
gtggggcagg 
tttaccgccc 
atcttgaggt 
gcaactgcag 
gcttttccca 
gcatgtgggg 
cagcaatacc 
gcacagcggg 
cacctggggg 
atccatgcct 
cctgcagccg 
ttgcctgcca 



ttgcagattc 
tggcctggtc 
ctcagactag 
aacatggcca 
tctttgcagc 
ctttgatttc 
ctccatggct 
tttttttttt 
acgatctcgg 
tcccgagtag 
ttagtagaca 
gatccacccg 
ccccttgttc 
tgtcctcttg 
ccacaagttc 
tttggggagt 
ggtctgcggt 
ctggtgtcta 
gactgctggg 
gtgtttgcta 
aaatttttcc 
ggggtcatga 
acaaaccctt* 
atttttacat 
caagggataa 
ctcgggagct 
cccttggctg 
aggcctttgc 
gcgggccccg 
cacgcggtgc 
ccagcgcgcc 
gcccggttgg 
cacagggcgg 
gctgaggaga 
ctgggcgcgc 
ctcacacccc 
ccgtgtttct 
ggccctcctg 
aagtctcgag 
gctctgctct 
tcaggggtgc 
ccctcatttg 
gtgcttgact 
gatgtgggag 
actcattctg 
gccgctctgg 
aaagtcagga 
cctgcatccc 
caggtcctgb 
bttgcctctg 



tgactcaact 
tgtctacatt 
atcctccgtg 
aaatttaatt 
aaaaatggcc 
tcccttgtct 
actttatttc 
tttttttgag 
ctcactgcaa 
ctgggattac 
tgaggtttca 
cctccgcctt 
attctttgca 
taccggatga 
ctaaataggc 
ttttaaaatt 
gggatccagg 
cccaggggag 
tgtgtggctg 
aataaagact 
aggctccctc 
gtgtttgaga 
tctcaagaag 
ataagcagcc 
accattttcg 
gtccccgccg 
ccccttccgc 
gcacgacgcc 
cgggctggaa 
cccgcgcgat 
cagcgggtgc 
gcccgtgctt 
gccccgactt 
gccgcgcaat 
tctcgcagag 
accccggacg 
ggaggccagc 
gtccggtttg 
ccttcacttg 
ccactgcgga 
.attctgggtg 
gctctgtcac 
tggcactccc 
tagggcttag 
gcatcgtgag 
agccactgcc 
aggtctgact 
ctccaccctc 
tctgaggggc 
ttttctcgct 



tccatatctc 
catctgcccc 
ccagtacctt 
cccttttctt 
actaggctcc 
ctacctctga 
tctttgagaa 
acagagtctc 
cctccgcctc 
aggcacccgc 
ccatgttggc 
ccgaagtgct 
ttctgtcaca 
gagggtctcc 
cagagcagtg • 
tactgatgcc 
ctagcgggga 
agcaaccttt 
cgtgcatatt.. 
caacaaaact 
ccataggtgg 
agaggcaagc 
ggccaaggca 
acctctgctc 
cgcactcttc 
accccctctg 
tctcgtaggc 
ccagatgaag 
gcggtggcca 
gcgcagcgcg 
cagctcccgc 
cgctggcttt . 
cagcgcctcc 
ggaaacctgg 
c.cccccccgc 
gccacaacga 
tttactgtgc 
ctgaagcaac 
gggtgaggag 
gactggggct 
gattcccttg 
cctgggctgc 
tgcaggcagg 
aggtccaagg 
cagcgcttag 
tctagcacat 
ggccctggaa 
cctgccattt 
tgaacacatg 
ccagccgctc 



8040 
8100 
8160 
8220- 
8280 
8340 
8400 
8460 
8520 
858.0 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
' 9660 
9720 
9780 
9840 
.9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
-10680 
10740 
10800 
10860 
10920 
.10980 
11001 



<210> 6 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 6 

gcggggttgg taggggcgtt gttttggtat • agttcggggt ttggtagcgg cgggtggggt 



60 



atcggttaag agttgttatc gtcgcgggga ggggagttcg gttcgtcggg atcgtaggta 
acgjgtcgcg gggt?tcgcg ggttaggagg ggaacggggt cgggcgggcg agtagcgggt 
aaSSrttt agggttcggt ttcgggtttt gtcgtcggat ttgggggtcg cgaggaagag 
KqcSgtcg agggtttggg gtcggcgtat ttttttcgtt ttgtttgtag ttggaaaatt 
tt?t?t?aag t??ggggcgg cggagtttcg ggggagaagg ggtcggggga gt cgcggagg 
gaggcg^cgg gttcgcgcgt gtagggttta ggtcgaggtc gggacgcggg tggggcgtag 
X?cgggt?a gggtcgtagt cggttgtgcg tcgtgttcgt tcggggcgtt gttttttttt 
ftttllllll gttgcgtggt tttttttttt tttttatttg ttttttgttt tagttttttg 
t"cga?a?a Lg?t?tttt cgcgtcgggt tcggttttcg cgttttgttc gt tacggtag 
tlltlatttt cqtttttcgc gcggtcgtcg ttcgggtttc gatcgagggt tgatagtttt 
cgfttfgggc gfcgkggg cgggtatcgc gttttttttt ttcgtattat tttttttaat 
tggggtaatt tttttcgagg cgggaggcgt tggtttttcg gttttttttt "tttatttg 
aataaaqttt ttcgttttga atgatttttt ttgaagcgga tattttattt aaatcgggta 
SSSttti aaagggttat tgcgtttgaa tagttttttt ttcggaagtt ttagtattta 
attlaaStt ttggggcgtg taggtcgttt tggttttttt tttatcggcg gtcgtttatt 
?tttg?t?" Ittlllggtl cgggcgggtc ggtttgggtt tttattttag agggtagtcg 
qttt?tcgtc ggtgtt?agg tcgt.agggtt gatgttttcg tttagttgag ^^aggggaa. 
Sggagggga gaagtgtcgg gttggggtta ggcggttagg gcgtcgtacg ^ttttattc 
aq?cgg?gtg tgttttcgta ggagagtgtg ttgggtagac gatgttggat acgatggagg 
rltZllalll ttttaggtag ttgttgttgt agtttaataa ttagcgtatt aagggttttt 
tSgcgacg? ga^ta?cgtg gtgtagaacg ttttttttcg cgcgtataag -cgtgttgg 

agg?ttgt?a ?tcgttttta gtcgggtttt cgtcgtcgtt tgtcgcggag tcgttttcgg 
S?tagaggt cgcggttaat acgtattgcg tcgagttgta cgcgtcggga "cggttcgg . 
?cqtcgtatt ttgtgtttcg gagcgtcgtt gttttttttt ttgtggtttg g^tttgttta 
agaagagttc gtcgggtttc gcggcgttag agcggtcgtt ggttgagcgc Jjgttgtttt 
.Sgcg^tcgga Jagttttttt agcgtcggtt tcgtcgttta taaggagtcg "tttcgttt 
tatcatcgtt gtcgtcgttg tttttttaga agttggagga ggtcgtatcg tttttcgatt 
tatt?cgcgg cggLgcgg? agttcgggat tcgagttttt cggtcgtttc g^gggttta 
attt«?t?a tcgttgga?g aagtacgagt cgggtttggg tagttatggc gacgagttgg 
gtcgggagcg cggttStt? agcgagcgtt gcgaagagcg tggtggggac gcggtcgttt 
cgttcggggg gttttcgttc ggtttggcgt cgtcgtcgcg ttattttggt ^gtttggacg 
SStcqgcgc gggcggcgac ggcgacgatt ataagagtag tagcgaggag atcggtagta 
acaaqqat?t Kgttcgttt ggcggttatt tcgagggtta tttatgttcg tatttggttt 
Sqqcqaqtt cglgag?ttc ggtgataatt tgtacgtgtg tatttcgtgo ggtaagggtt 
JtttSSS ?gagtagttg aacgcgtacg tggaggttta cgtggaggag 9'99"«°£ 
tatacaJtag ggtcgaggcg gtcgaagtgg tcgttggggt cgtcggttta 999ttttttt 

S ■ S25S5 £3SS S355 3gjS 
■S 3SESS S=F " S . 
S 55 SSSS 55555 SSSS 53553 ffi J 

t?a??ag?ta Ltgaagatg tacgtcgtgg ggggcgcggt cggcgcggtc 9999cgttgg 
cqqqtttggg ggggtttttc ggcgttttcg gtttcgacgg taagggtaag ttcgattttt 
tcaaaaaS? tt?tgttgtg gttcgtttta cggtcgagta gttgagtttg aagtagtagg 
Sagged cgcggtcgag ttgttggcgt agattacgta ttttttgtac gttttjjgg 
tar,r.n?taaa qactttttat tcgttggtta agtttacggt cgagttgggt tttagtttcg 
SSSSS cglggtgttg ag?ta^gcg tttatttgge 99tcgggttc ? acggtcgga 
ttatcgatcg tttttttttt atttagagcg tttttcgtta ^tcgttttg tcg^tgttgc 
gcggt?ttgg ttcgtatttt agggagcggc. gggggcggcg cgtagggttt ^tgtgttcg 
ggataatcgt agcgtcgtta .tagtggcggt tttatttttc ggcggtttta tttggtttta 
ttgtttcgtg ttttagttcg g 



<210> 7 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 7 

tcgagttaag gtacgaagta gtgaggttag gtgaggtcgt cgagaggtgg agtcgttatt 
gtggcgacgt tgcggttgtt tcgggtatag tgggttttgc gcgtcgtttt cgtcgttttt 
tggggtgcgg gttagggtcg cgtagtagcg atagagcggg ttggcgaggg gcgttttagg 
tgggagagaa acggtcgatg gttcggtcgt cgggttcggt cgttaggtga gcgttttggt 
ttagtatttc ggtcgttttg tcggggttga ggtttagttc ggtcgtgaat ttggttagcg 
ggtagaggtt ttttagcgtt attttggggt cgtgtaggaa gtgcgtggtt tgcgttagta 
gttcggtcgc ggtcgttttg ttttgttgtt ttaggtttag ttgttcggtc gtgaggcgag. 
ttatagtaaa gacgtt.ttcg gggaagtcga gtttgttttt gtcgtcgggg tcggggacgt. 
cggggagttt ttttaagttc gttagcgttt cggtcgcgtc ggtcgcgttt tttacggcgt 
gtatttttat gtggttgatg aggttgcgtt gttgtgcgaa tttgtcgtcg tatatttggt 
attcgtaggg tttttcgttc gagtggatgc gtatgtgttt cgtgaggcgg tattggcgcg 
tgaatcgtat gtcgtacgcg tcgtacgcga agggtttgag gtttaggtgg ttgcgtatgt 
ggcgcgttat ggttttacgt tgcgtgaatt ttttttcgta gatggtgtat gggtagggtc 
gggttagtta gtgcgttttt tcgtgttgtc gtagcgtggt cgggtttttg tagtttttgt 
cgtacgacgc gtagcggtag ggtcgtagta .gtttttttag' gttattcgga gtttcggcga 
ttttgttttc gtcgttttta aaagggggtt ttaggtcggc ggttttagcg gttatttcgg 
tcgtttcggt tttgtcgtat agcgtttttt ttttttttac gtgagttttt acgtgcgcgt 
ttagttgttt agagttgggg aagtttttgt cgtacggaat gtatacgtat aggttgttat 
cgaagttttc gggttcgtta taggttaggt gcgggtatgg gtagttttcg aggtggtcgt 
taggcgggtt ggggttttcg ttgttatcgg ttttttcgtt gttgtttttg tagtcgtcgt 
cgtcgtcgtt cgcgtcgggt tcgt.ttaggt tgttagggta gcgcggcggc ggcgttaggt 
cgagcggggg tttttcgggc gagacggtcg cgtttttatt acgtttttcg" tagcgttcgt 
tgggggagtc gcgttttcgg tttagttcgt cgttatagtt atttaggttc ggttcgtgtt 
ttatttagcg atagaggaga ttaggttcgt cggggcggtc ggggggttcg ggtttcgggt 
tgtcgttgtc gtcgcgaaat gggtcggaag gcggtgcggt tttttttagt ttttggaagg 
gtagcggcgg tag'cgacggt agggcgagag gcggtttttt gtaggcggcg gggtcggcgt 
tgggagggtt gttcgggcgc gggggtagtt cgcgtttagt tagcggtcgt tttggcgtcg 
cggagttcgg cgggtttttt ttggataggt tt'aggttata aagaggggag tagcggcgtt 
tcgaggtata gagtgcggcg gtcgggtcgg gtttcgacgc gtatagttcg gcgtagtgcg 
tgttgatcgc ggtttttggg ttcgagggcg gtttcgcggt aggcggcggc ggaggttcga 
ttggggacgg gtagtaggtt tggatgatcg gcgtggcggt tcgtaggttt cggttcggtc 
gattataggg cgcgtagtcg tcgtcgtcgt cgtcgtcgtt tcgtaggtgg tagtatttgt 
cgtggcgttt gaggcgtttt ttgtatagcg- ttacgaggtc ggggatttgt aggtagttgg 
cggcggtt'ag tacggcgttt aggttcggtt tagttttcgg ggttacggtc gcggtcgtag 
tcgtttttgc gtcgttagtt aggcggtcgg tgtagatgaa gtttagtatt aggcggaata- 
cggtcgggtt tattatgtta tggtttaggt tgagtaggtt gttatgtatt attagggatt 
tgaggtaggc gttgttggtc gttagtacgt ttttgtgcgc gcggaagagg gcgttttgta 
.ttacgatgat tacgtcgtat aagaagtttt tggtgcgttg gttgttgagt tgtagtagta 
gttgtttgga gtggtcgggc gtttttatcg tgtttagtat cgtttgttta gtatattttt 
ttgcggggat atatatcggt cgggtgagag tcgtgcggcg ttttggtcgt ttggttttag 
ttcggtattt ttttttttta tttttttttt ttttagttga gcgggggtat ' tagttttgcg 
gtttgggtat cggcgaagga tcggttgttt tttggagtgg gagtttaggt cggttcgttc 
ggattaggag aaggagtagg aggtgagcgg tcgtcggtgg aggggaggtt agggcggttt 
gtacgtttta gggtatttgg "ttgggtgttg gggttttcga gaagaaaatt gtttaggcgt 
agtgattttt ttggagatag ttattcgatt taagtaaaat gttcgtttta ggaaaagtta 
tttagggcgg agaattttat ttaagtaggg agaaagggag tcgaggaatt agcgtttttc 
gtttcgggag aagttgtttt agttggggga agtgatacgg aggaggggag cgcggtgttc 
gttttggcgt cgttttggtc gggggttgtt aattttcggt cggggttcgg gcggcggtcg' 
cgcggggagc ggaggtagcg gttgtcgtgg cgggtagagc gcgaaggtcg ggttcggcgc ' 
.ggggagggcg ttatatcggg gtaggaggtt gaggtaggaa gtaggtgggg gggagggggg 
agttacgtag tttttagggg agggaggggg tagcgtttcg ggcgggtacg gcgtatagtc 
ggttgcggtt ttgattcggg tttgcgtttt attcgcgttt cggtttcggt ttgggtttta 
tacgcgcggg. ttcggcgttt • tttttcgcgg- ttttttcggt tttttttttt tcggaatttc 
gtcgttttaa atttggggaa aagtttttta attgtagata gggcgggagg agtgcgtcgg 
ttttaggttt tcggttcgta gttttttttc gcggttttta aattcggcgg tagagttcgg 
agtcgagttt tgagtttttt tgttcgttgt tcgttcgttc gatttcgttt tttttttggt 
tcgcggggtt tcgcggttcg ttatttgcgg tttcggcggg tcgggttttt tttttcgcgg 



cggtggtagt ttttagtcga tgttttattc gtcgttgtta ggtttcgagt tgtgttaggg 
tagcgttttt gttagtttcg t 

<210> 8 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 

<223> chemically treated genomic DNA (Homo sapiens) 



3480 
3501 



<400> 8 





tttttatagt 
tagtataaaa 
tttttttttt 
gggtcgttat 
ggatagggtt 
ggttgtatcg 
atttaggaga 
agattattta 
aagtagtggg 
gtagggatta 
gttatttttg 
aaatttagaa 
gtgtcgagtt 
aaatggtttt 
ggattttaag 
gattttcgtt 
gttttgtttg 
cgacgggcgg 
tcgcgttttt 
gtagttgagg' 
attttttgag 
tttttagtat 
ttgttgattg 
tatttttttt 
cgcgaggggc 
ttattcggtc 
tttttttaaa 
gttataattt 
ataggggtta 
ttggagagta 
gcggtaaaat 
aataaattat 
agtgggtgat 
ttattggtag 
taaacgttat 
taaagtggtt 
attttatatt 
ttttttaaat 
attttataac 
gaggtttagg 
ggggttatat 
tttatatatg 



gtaaatgtgt 
tttaaatatt 
taaatttagt 
tttaggtttt 
tgtttatttt 
ttattgaaag 
t'gtttagtaa 
ggaatatttg 
ggtttaggat 
ttatattgat 
gaagtttttt 
aacgtttatt 
aattattttt 
tggaaggtaa 
ggagtcgcgc 
tcgttttcgt 
ttcgtgtgtg 
gtggtattgg 
tgttgaggcg 
ttatttcgtt 
gaggaaagag 
ttcggtgtgg 
taaaattggg 
gtaattaatt 

ggggg.ttgga 

gtttagtgtt 
gagggcgttt 
gtt.ttcgaaa 
ggcgtatgat 
gagaatttcg 
ggaaaattgt 
ttatattaat 
ttttgtaatt 
tattttatat 
aaattggaat 
ttttatgttg 
tatagttgtt 
gtttttagga 
gtttggtgtt 
"ga.ttatattt 
ttgaaaatta 
gtatgtagta 



ttttattatt 
taaatatgat 
tagtgtttgt 
ggtttttttt 
gatattatta 
tttattattt 
atgttgattg 
ttaaaatgtt 
tttgtatttt 
aaatttttat 
aaggataggg 
gagtgaatga 
ttatataagt 
gttttttagc 
gttttgcgtt 
tatttttcgt 
tgtcgttgta 
ttatcggcgc 
tagttggttg 
gaggtggtgg 
tcggaaatat 
gttttttaag 
tgaaagtttt 
tttttagatt 
attcgttttt 
tgtttttttt 
gataagagtt 
tttattggag 
ggagaaagaa 
aagtggatcg 
tatgtgaaat 
atagatgttg 
tttaagtgta 
ttggaaaggt 
tttgttaatt 
ttttttagaa 
tggttatttt 
aaatattttg. 
ttaaattttt 
taagacgttt 
tttaaggata 
tattttatat 



ttttggagta 
tttgttggtt 
gttgtttgta 
tagaaagttt 
gtttatttat 
gattttgagt 
aattatgatt 
tttgtttttt 
gatagttttt 
tatttttaag 
gtcgtatgtt 
tgaaacgata 
tttagtagtt 
gagtggagtt 
tattttttta 
ttttttttgt 
ggttttattt 
gggggtaggt 
tttttttcgg 
ggaggggaat 
ttggtttttt 
gttttgtttg 
ttttgatttt 
tttgtttagt 
tgaagcgtag 
tgtcgtaaga 
ttttttcgtt 
ttttggtaga 
aatgggagta 
gttagtggga 
agttttattt 
tgtagtgaga 
gggattttga 
tttaatgtat 
aatttttat'g 
aatgttttta 
tgtatgggta 
tttatatttt 
tttgatagt-t 
ttgttatcgt 
gggattgttt 
tagtatttag 



attttattta 
ttgtttttgt 
atgttttttt 
tttttttttt 
atatattggt 
agtttgagga 
gtttttaata 
gagattttat 
tgatgtttgt 
ggggattttt 
gtttttttag 
ggtgaataga 
tttattgttt 
atttttaatt 
ttagaaatcg 
ttcgtagtcg 
atgggtttat 
gagtatgcga 
gtcggtatac 
ggttattttt 
aagtaggtat 
agaggagagg 
tatttgtggg 
tttttaaagg 
agatgttagt 
ttttaagttc 
ggagtttgta 
ggttgtaagt 
ggatgggtat 
aagttgtttg 
tttaaagtat 
ttttatatta 
tattatgtat 
gaattatttg 
tatttttata 
gtgttgatga 
tgtatttggg 
gtttttattg 
tttggataat 
taaggagatt. 
tttttgatat 



aaatcgtttt 
ggttttattt 
ttttttgtag 
tttttagcgg 
tataagttta 
ttttattaaa 
tataaacgta 
tttgggaggt 
atgtagaagt 
ttttttaggg 
gttagtaatt 
tgaacgtaag 
ttagtcgtag 
atatttttta 
gtaagttatt 
gcgtttagcg 
cgttgaggtt 
aggttggagg 
gcgcgtagtc 
gaggtatcgt 
agttcgtttt 
ttaggttggg 
tatcgattgt 
attgaaaagt 
ttttgaaaag ' 
gtgagaggat 
tgtttagtaa' 
ttaaatgcgt 
atttgaggaa 
tattttagga 
aaaaaattaa 
gttttttatt 
ttttgatttt 
agttatatat 
tttgtattga 
atagttaagt 
tgtagttata 
taaataatgt 
ttttatgtag 
ttttttttta 
tattagtata 



60 
120 
180 
240 
300 . 
.360 * 
420 
480- 
540 
600 
660 
720 
780 
'840 
900 
960 • 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
168.0. 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
.2280 
2340 
2400 
2460 
2501 



<210> 9 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 



<223> chemically treated genomic DNA (Homo- sapiens) 



<400> 9 



attgagtatt 
gaagtagttt 
agcgatggta 
gagttgttaa 
taaagtgtaa 
atgtttatgt 
ttgaaagtat 
atataggaat 
tatgtattag 
gttaagattt 
attttattgt 
aaaatggggt 
tttttattgg 
ttatttttat 
ttttgttaag 
taacggggaa 
attttacgat 
tttgcgtttt 
gattaggtaa 
aagggttagg 
ttaggtagga 
tgagagatta 
tattttt-ttt 
ttcgaaaaga 
tatttgtttt 
tgaataaggt 
gataggagga 
gtaggaagat 
tgattttatt 
gagttgttag 
ttgtcgtttt 
tagtatgcgg 
ttttagggat 
aggggattgt 
taaggggtaa 
tagttatagt 
tatttagagt 
tgtaagtgag 
aaaatttttt 
ttataggtag 
aagttagtaa 
ttattttaga 



ggtgtaaaat 
ttatttttga 
aaaacgtttt 
aaaagattta 
ataaaatgtt 
agagataatt 
tttttgaaag 
taattgatag 
ggttttttta 
ttgtatttgg 
atagtattta 
tattttatat 
tcggtttatt 
tttttttttt 
attttagtgg 
gggtttttat 
agagggaaat 
agggggcgaa 
agatttaaga 
gagaattttt 
ttttgggaag 
ggtgttttcg 
tttattattt 
gtagttagtt 
cgcgtcggtg 
ttgtaacgat 
ggcggggaat 
gagcgtagag 
cgttggaaga 
gatttgtgta 
attatttatt 
tttttatttt 
ggtaaagatt 
taaaatgtag 
aaatattttg 
ttaattaata 
tagatggtaa 
ttaatggtgt 
gaaggaggat 
tataaatatt 
gattatgttt 
gagtggtggg 



gtattgtata 
atgattttta 
agaatgtaat 
aaatattaag 
tttttaaaaa 
agatagttat 
gtagtataag 
aattttagtt 
ggtataaggt 
aggttataga 
tgttggtatg 
aataattttt 
tcggaatttt 
tattatacgt 
atttcgagaa 
tagacgtttt 
aaatattgaa 
ttttagtttt 
gagttaattg 
atttagtttt 
tttatatcgg. 
gttttttttt 
tagcggggtg 
gcgttttagt 
attagtatta- 
atatatacga 
ggcgggggcg 
cgcgcggttt 
tttgtttttt 
gaagaatagt 
taataaacgt 
tgaggggttt 
tgttagtgta 
agttttggat 
ataagtgttt 
tttattgagt 
atttttaata 
taaaataaat 
taggatttaa 
agttaggttt 
agatgtttga 
aatatattta 



ttatgtgtaa 
gatatagttt 
ttttggattt 
cgttgtaaaa 
tatttagaag 
gggtataaaa 
aagttatttt 
tgtaacgttt 
attattagtg 
ggttatttat 
aatggtttgt 
tattttgtcg 
ttgtttttta 
ttgatttttg 
tagattgtga 
ttttaaagaa 
cgatcgaatg 
cgtttttcgc 
taggaagagt 
gtaattagta 
ggtgttgggg 
ttttaagaga 
attttagttg 
aaggggcgcg 
ttcgttcgtc 
ataagtagag 
ggacgagggt 
ttttggaatt 
aggggttatt 
taattcgata 
tttttgggtt 
ttagagatag 
ataatttttg 
ttttattgtt 
ttaaatgatt 
attttttgag 
acggtgtagt 
agattttatt 
agtggcgatt 
ggggagaaag 
attttgtgtt 
tattatggaa 



gtatgttaat 
ttgaaaagga 
tttgtatgaa 
tatattattt 
gtatgattat 
tatttggtta 
attaatatag 
aatatataat 
gagaattaaa 
tggtgagaaa 
tttaattttt 
tttttgaaat 
gttttttaga 
tgcgtatttg 
tttgttaagt 
ggttttttta 
attttttagg 
ggttttttag 
aataatcgat 
gtttagtttg 
agaagcgggt 
tgcggtgttt 
cggttgcgcg 
gtttttaatt 
gaattttagc 
tcgtt'ggacg 
tagtgatttg 
ttgggaaatg 
tttgcggttg 
ttttgcgttt 
tagttgttga 
tttttgggaa 
tatttttata 
tattttttaa 
ttgcgtttgt 
ttttgatagg 
ttagatttgt 
ttcgttgggg 
tttgtaaaga 
agggtaaagt 
gaaaacggtt 



ggtgttaaaa 
aagttttttf 
aattatttaa 
ataataaaag 
atttaaatat 
tttattaata 
atatgaaagt 
ttaaataatt 
ggtgtataat 
ttaatgtaaa 
tgtattttaa 
ataggtaatt 
tgtgtttatt 
agtttataat 
atataaattt' 
■ cgaatttaaa 
agttgatatt 
ttttttaaaa 
gtttatagat 
gttttttttt 
tgtatttgtt 
taagaataat 
cgtatgtcgg 
ttcgtatgtt 
ggtgagttta 
tcgattgcgg 
tcgatttttg 
tagttaagag 
gaagtaatgg 
atttatttat 
tttagagaaa 
ggaaagtttt 
tgtaaatatt 
aatagaattt 
atgttgagaa 
atttttaagt 
gattaatgtg 
agaaagagga 
aagaagggta 
tataaaagta 
ttaagtagga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 . 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2501 



<210> 10 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 

<223> chemically treated genomic DNA -( Homo sapiens) 



<400> io' 

aaagatgatt 
agggtatttt 
ggttataatt 
ttttttttta 
• gagaaatttt 
ttataatgtg 
ttatttttgt 



aaaagtttaa . 

tagtagtttt 

tttgtgtttt 

ttttgtttta 

aaggt'ttata 

ttgggtattt 

ggattttttt 



ttgtttattt 
tgtttatttt 
agtaaatttg 
ttattttgat 
ttatttaaat 
ttagatttgt 
tttatttttg 



gaagagttga 

gtttattcgg 

tgtatgttta 

gtaaaattat 

tttgttagat 

ttattaaaaa- 

agatgttttt 



tttttttatt 
ttttttttgt 
tttttttttt 
ttgttaattt 
ttttattttt 
gatgtaaaat 
tttggttgta 



tttgtaataa 
ggttgtgtaa 
tgttattatt 
tatttgaaat 
gttatatggt 
.aaaataatga 
ttattttttt 



60 
120 
180 
2.40 
300 
360 
420 



13 



attttttgtt tattgattag aggaggggtt ttaattatgg gtgaatttta tattttattg 480 

aagaggttat gttatatgta tatttttata atataattta tatttatata gtatttttat 540* 

ttttagtata ttttttttta ttaattttaa taatattatt gtaagttatg ttgaagtaga 600 

ttgtaagtgt ttatttataa attgtgaaat gaattaaaat gaaagggtaa agattaaatt 660 

* atgattaggt ttgaaattaa tatataagat ttaatttttt • ttaattaaag atttttgtag 720 

gtgatttttg tttgtaggat tttttttttt ttttagatgt tattggattg tattaggttt 780 

attgtagatt ttagtcgttg tagaattaat tagatttaag atgagttttt tgattttttt 840 

tggtagagtt ttttaattgt tgaattttaa tattgtcgtg attagttagt gttataattt 900 

gtttgtttta ttttgtgtaa tggattttat attatagagg tattttttta atgttaagat 960 

gtttaagtat tgtttaagtg taaattattt aatatttttt agttattaag taattaagat 1020 

aggtaggatt ttatttgttt taaaatgatt tgatttaaat taaaaagaga atgtggattt 1080 
tttgaatttt atttggttaa "ttttaatata atttttagta ttttataatt ttttttaaag ■ 1140 

tttttttatt tggttatttt ttgtattttt tttgtttttt tttttttttt ttagttataa 1200 

taattgttag attttgtttt attttttttt gatagttttt atttttaagg ttatttattt 12 60 

tttttaggta ttttttggtt ttagtttgag tatagtagat tttaagatta tatatgttat' 1320 

agtataggtt attatagtta attttttgaa taaatgtgat tgaattttat gttagtaatt 1380 

tttatttatt atttttttat taaaaaggtt taaagttttt atttaatgtt tttttttatg 1440 

tttattttgt taaatgattg ttttttaatg atattttaga attttagaat tattttatta 1500 

tggaggatgt gtaagattag ttttttatta aataaaaagt gtgaaatgga. atatgtaatt 15 60 

ttattaattt attttggttt taaaattttg tgattattag ataaaattta gaaataaaat 1620 

agtattatta atataaataa atttttatta taattatatt ttttaagttt tgtttgtaag 1680 

aatgggtaaa atatttttaa aattttgaag aaattattat ttgatagaaa gtttaattta 1740 

tttgtgagaa ggtaaatgta tttagatata attaaagttt' ttttttttat tttaatttta 1800 

tttattttga attaagattt tattgtttta tttttttaga tgttgttatt tgaataatat 18 60 

tgttttgaga ttaaaaatta gtatattaat ataatttttt ttaaacgttt taagagtttt ' 1920 

gtttttttta. tttttttttt taaaaataag tagttattaa attttttagt agtgaatttt 1980 

aaaatttttt ttaattttat aggtttaagg gtagttaagg atggttgtag ttttatatga 2040 
ttagttgtta aagtaagttg aggtattgaa . gatggagaat ttaaattttc gataagagtt. , 2100 

agaagataat tttaattatt ttataaaatt ggaaattgag gtatttaata tgaaggtatt 2160 

aagattgtga tttttaattg. tagtttattt atttttattt agtatttttt tttgtaaatt 2220 

tgaggtaaga tattttattt aaaagtgtat tttaaattaa gtaataatat gtaaattttt 2280 

ttttgtaaaa gttagtattt atatttttaa ataagatata ttgaatttat ttagtgaatt . 234 0 

atataaagaa aataagtgta aaattttaat ggttagttag. tttttagttt tttttaagat 2400 

taaagagaag agattaaata tagtattatt gtattgaggt aaggtttttt gtgtagttta 24 60 

tagaaattag 2470 

<210>' 11 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 11 

ttagttttta tgaattatat agaaaatttt gttttagtat agtgatgtta tatttggttt 60 

ttttttttta attttaaaaa gaattaagaa ttaattagtt attggagttt tatatttatt 120 

ttttttatat gatttattga atgaatttaa tatattttat ttaaaaatat aaatgttaat 180 

ttttgtaaga aagagtttat atattattgt ttaatttaaa atatattttt aagtaaagtg 24 0 

ttttatttta agtttataag agggaatatt gaataaaaat. ggataaatta taattaaaag 300 

ttatagtttt gatattttta tattagatgt tttagttttt agttttgtaa gatgattgga 360 

attatttttt agtttttgtc gaagatttga gttttttatt tttagtgttt taatttgttt 420. 

taataattga ttatatgaag ttgtagttat ttttggttat ttttggattt ataaggttaa 480 

aaaggatttt gaaatttatt- attaaaaaat ttagtggttg tttgttttta aagaaagggg 540 

taaaggaaat aaaattt.tta agacgtttaa gaagaattgt gttaatatgt tagtttttgg 600 

ttttaaaata atattgttta- agtagtagta tttaagagga tgaaatagtg gagttttagt 660' 

ttaagataaa tgaaattaaa atagaagaga gaattttagt tgtgtttgaa tatatttgtt 720 

tttttataga tggattaaat tttttattaa gtaataattt ttttaaggtt ttaaagatat 780 

tttatttatt tttataggta "aaatttagga aatataatta tgataaaaat ttatttatat 840 

tagtaatatt attttatttt tgaattttat ttgatagtta tagaatttta gagttagaat 900 

ggattaatga gattatatat tttattttat attttttatt tgataaaagg -ttaattttat 960 

atatttttta tggtgaaata gttttgaagt tttaagatgt tattaaaagg taattattta 1020 



ataaaatgga 
ggtaaataag 
agtttatgtt 
tatttaaaga 
ttggtagtta 
ggtaagagga 
aagatttagg 
aattttgttt 
atatttaaat 
taaggtagat 
gattttatta 
aatttatagt 
tagggattat 
gtttggttat 
atatttataa 
tatgttaaaa 
ataatttttt 
ggtaaggggt 
ataggagtga 
tatattataa 
aaaatttttt 
taagagaaaa 
agttataatt 
aaagtatttt 
aattattttt 



tatgaaggag 
aattattaat 
atggtatatg 
gaatggatga 
ttatgattgg 
ttttaggaag 
agatttatat 
attttaatta. 
attttgatat 
aggttgtaat 
aaggaaatta 
aaatttggta 
ttataaaagt 
gatttaattt 
tttgttttaa 
ataaaagtat 
tagtaagata 
gaagaagtaa 
ttattatttt 
attatatgat 
attttaaata 
ggtagtaata. 
ttatataatt 
ttattatagg 



agtattaaat 
ataaagttta 
tggttttggg 
ttttaggagt 
gaaagaagag 
aattatagaa 
ttttttttta 
tttaatagtt 
taaaaaaatg 
, attggttagt 
ggggatttat 
taatttaatg 
ttttggttga 
ttgttttttt 
tataatttat 
tatgtaaatg 
tagggtttat 
tgtagttaaa 
gttttatatt 
agaggtaggg 
aaattaatag 
gagaaaaaaa 
ataaaaagag 
aataaaaa'aa 



gaagatttta 
attatattta 
atttgttgtg 
agagattgtt 
gagagataaa 
tgttaggagt 
gtttaggtta 
aaaaagtatt 
tttttgtaat 
tacgataata 
tttagattta 
atatttgagg 
aaaaaattga 
attttaattt 
agtgatatta 
taagttatat 
ttatagttaa 
ggagatattt 
tttttaataa 
atttgataga 
gtaattttat 
tgggtatgta 
tcggatgggt 
ttaatttttt 



agtttttttg 
tttaaaaggt 
tttaaattga 
aaagagaaat 
gaagatataa 
tatattaaga 
aattattttg 
aagtagtttg 
atgaaattta 
ttggagttta 
gttagtttta 
aggaagggga 
gttttgtgtg 
attttataat 
ttaggattaa 
tatgaaaata 
gatttttttt 
taaaaataaa 
gtaggtttga 
atttgaataa 
attaaaataa 
taagtttatt 
aagatgagta 
agatgaataa 



ataggaagat 
tgattataat 
ggttaaaaga 
gaagtagagt 
aagatagtta 
ttaattaagt 
gaataaataa 
tatttaagta 
ttatataaaa 
gtaattggaa 
taacggttag 
gttttgtagg 
ttaattttag 
ttgtaaatga 
taaaaaaagg 
tatatgtaat 
ttgattaatg 
ggaaaaattt" 
aaatatttag 
tgtgaatttt 
taaaataaaa 
gagatataga 
gagattgtta 
ttaaattttt 



1080 

1140 

120O 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460' 

2470 



<210> 12 
<211>.7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 12 

aatgtaatgg 
gtgtggaagg 
ttggagagat 
gaatggtgat 
ttggagatat 
tagtaagtta 
gagttgggag 
gtatattttt 
aagagaatgt 
ttttgattgt 
tttttatgtt 
tttttttttt 
tggttattgt 
tttaatattt 
gtgtttaaat 
ggtattatta 
tataaatttt 
gtgattagag 
gggtgttatg 
gttagtatta 
gttattggga 
attagtttaa 
tacgtatttt 
gggtttggtt 
ttaaaatatt 
gtttagagat 
tttattgtta 



aaaaagagag 
taagggaaaa 
agtgatgttg 
ttatatttga 
atttaacgga 
cggttggaac 
ttcgagtgaa 
ttatttatgt 
tggagagaaa 
aagtttttga 
aatagtgaat 
tttttttttg 
tgatttatag 
atttatattt 
attgtaatat 
atatttttat 
ttagtaattg 
ttatttaata 
tgttttggga 
tggtcgagag 
aatgagagat 
aataaaggta 
ttatatgttt 
aacgaagtgg 
tattttaaag 
tgggggttag 
aattatattt 



attgtaaagt 
ttagttttcg 
ggtatagata 
ataagtttgt 
ggagttagat 
gtgtgtgttt 
gtttttgtta 
ttattaagtt 
gtggttaaga 
ataattaaag 
aattgtagat 
aatatttttg 
gaggtattat 
agtatttatt 
tgggggtttt 
ggttttttat 
tatagtgttt 
tttaaggtgg 
ttttgagcgt 
aatagttgag 
ttcgttttta 
"tttattttta 
atagaaagta 
aggagtatta 
tttggttttt 
ggttagtaag 
tttcggaaaa 



tagaaggttt 
aagaagatag 
cggggaagtt 
aatgtttagt 
taatttttat 
agtaggagag 
aaggtagaag 
tagggataag 
aaattgtttt 
tttgtgagga 
tttttttttt 
tttttttttg 
tgttattaa't 
tttaatattg 
ttaaagtata 
tttttttttg 
tagggttaga 
tgattaatgt 
gggtatttta 
aaagtggtta 
attacggtta 
ttttatgttt 
gttaattgag 
tattttagtt 
tggtgtaatc 
gtatttgatt 
attatttatg 



aggaattgtt 

tgagatttta 

gagaggaata 

agatcgttgg 

ttttttttat 

ggtagggagg 

aggaaagtcg 

gtttattaag 

tattgaattt 

gatagttaat 

tttttttttt 

ggattggttt 

aaagggtaat 

attatggaga 

aaaatatata. 

tattggtttt 

gatcggttat 

ttggtaataa 

ggagtatttt 

agaggtggat 

gtgtaattcg 

atattttagg 

ttatttatgg 

ggaaatatat 

ggaggtatgg 

tatatgtatt 

ttttattttg 



ttttgattag 
atttgggtgg 
ttatgtttga 
aaaagtgggg 
.ttgagagagt 
gaagttaaga 
gcgtagtata 
atgagtttgg 
tttgggttaa 
ttttttattt 
ttttttttgt 
agagtatggg 
agtttttttt 
gagttttttt 
tttgtatgat 
aagagttatt 
ttttggtatt 
agtttttatt 
agtattgcgt 
* ttatgtgaac 
aaagtttaaa 
tttttaataa 
aaaggtttgt 
ttttagaatg. 
taatgttttt 
ttagaaggtt- 
taaatttgat. 



60 
120 
180 
240 
300 
360 
• 420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020- 
1080 
1140 
1200 
1260 
1320 
1380 
1440. 
1500 
1560 
1620. 



-£->-• • • 
• • • 





atttatatat 
tgtttttttt 
gtaggggtaa 
ttgttttttg. 
tgtgtagtag 
tgtttttcga 
ttatatattg 
tttttgattt 
aggtttgttt 
tggttagtgt 
tgtgtttttt 
taggtttata 
ttgaaaattg 
gtaatgattg 
ttatttgttg 
gtatattgtt 
aaatagttaa 
atttcgttag. 
gtattagagt 
attatagacg 
tatatatata 
aatttagtta 
tagtttaagt 
attttaaaaa 
ttcgtgattt 
gttttggttg 
gtttttttga 
ttaatataaa 
•attttgggag 
ggtaaaatgt 
. ttttttgtta 
" cgtatatttt 
tatgtatata 
aatagagata 
' tttttggatt 
gattttcgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgta 
gttggcggag 
cggattcgta 
gttgcgtcgt 
ttttttgttt 
ttttttatat 
agtttttgaa 
tggatagtag 
tcgcggtcgt 
ttgaggttgc 
tttcgagttc 
acggttagta 
tcggttcgtc 
gttcggtagg 
cgcgggacgc 
ggttttgcgt 
aaacgtattt 
ttaataaaat 



' tttaggtaaa 
aaagttgggg 
ttttgtttaa 
agttatgatg 
tttacgtttt 
ttggttgagt 

m taggtgtttc 



ttttgattgg 
tttgtttggg 
aggggttggg 
tgagtagata 
tttagttgcg 
atttttgtta 
agttattcgt 
ttataagttt 
tgattttttt 
ttgtatttta 
ttttttttta 
ttttttaaga 
aaagtgtgat 
ttggttataa 
ataatttata 
taataatggt 
aaatgtaatt. 
atattttgtt 
ttattaggtt 
tatgatatat 
tatatatata ' 
'aaaatttttt - 
.gaatcgagaa 
tttgttagtt 
gaggttatgt 
tataattaat 
attatttttt 
tatataagtt 
tattttgtag 
agtttttttt 
gtttttttag 
attttatttg 
ttacgtattt 
tatcggagtt 
cgtttttcgc 
cgttttcgtt 
aagtttaggg 
taagttaatg 
tatgagttcg 
ggcgttcgtt 
ggttttcggg 
ttttaatttc 
tgcggggata 
taaagtattt 
tcgttcgtag 
taagttcgtc 
cgttaacgcg 
ggcgttcggt 
gttgatgtta 
ggtgttttat 
"ggtattttat 
agggagggag 
gcgattcgag 
gtagttcgcg 
tttatttatt 
atacgttgga 
taaatacggg 
gtgtttggag 
tgtgtttttt 
atgatttttt 
ggggttttag 
tcgcgttgga 
gcgagtttaa 



tattttattt 
gtggtagtag 
gttttttttt 
gtaagttttt 
tgtttgtcgg 
ttttacgtat 
atatgcgagt 
atggaatatt 
ttttgtagtt 
gtagggagat 
ggatttgtag 
gttttatgaa 
ttatggaaat 
agggaaatat 
gttattaatg 
ttatttttgt 
. tttagtaata 
gggtgtttgg 
gtagtaaagg 
tttatttatt 
tatatgtgtg 
tttatttgtt' 
gatcgagttg 
ggattagatc 
ttggtatgaa 
ttgtggaagg 
atatgagaat 
aggttgagag 
aggaagaaat 
atatgtatat 
ttgttaaata 
ttttatttcg 
ttagttaacg 
tggtacgggg 
gtttatttta 
ttttggtcgt 
aagttgtttt 
ttagggtaag 
ggagattagt 
ttgggattgt 
gtagggtcgg 

gggttgtgtt 

cggtttgtat 
gggatggttt 
tttaagattt 
gtgtataatt 
taggtttacg 
tttaacggtt 
ttgtattcgt 
tatttggaga 
aggtattcgc 
ggagggaggg 
ggtgcgcgta 
ttgcgtttag- 
tatcgttcgt 
aaattagata 
gcgttttgag 
ttagtagagt 
aaattttttt 
gggaagcgat 
agagcgattg 
gcggattgtt 
aataagttat 



tagtcgtaag 
aaagtatagg 
tttagtatag 
ttttattttt 
gaggggttgt 
aaatatattt 
atattttttt 
tttggaaaga 
ttagtatttt 
gaggattgtt 
gattttttgt 
gtgttttttg 
taaattattt 
ttgcgatgta 
ttaaattcga 
aagaattaga 
gtaagtttgg 
gatagtaatt 
aagtttattt 
ttttgttttt 
tgtgtatgtg 
ttttatttgg 
taggattaaa 
gataatgtaa 
aaggttatat 
tatgaatatt 
ttttaatggg 
aattttagaa 
tgaggttttg 
tttgaatttt 
tagttgtttg 
.gttatagtgt 
aggaggggga 
tatataaggt 
agtttagttt 
gaaatttagt 
ttgggatcgt 
gtaatagttt 
atttaaagtt 
atttgttttc 
.ggttagagtt 
ttttttttag 
tttgttcgcg 
tattgtatta 
ttttggagcg 
atttcgaggg 
gttagatcgg 
tggggggtt't 
cgtcgtagt't 
acg'agtttag 
gttcgcgtcg 
agaagggaga 
gggagttcgg 
agttaagttt 
gcgagaggta 
aagtagtagt 
ttatttggga 
ttagtagagt 
ttatttagat 
ttttgttatt 
•ggagttgaat 
ggtatgtgat 
atggaagtat 



attatgattt 
gtatttttta 
ttttttttgg 
tattgttatt 
taagtgtttt 
atatattttt 
tttttttttt 
cgtttttgat 
gagaaagtaa 
gttttttatg 
gttatttgta 
tatgtgtttt 
gtaaaaaatt 
tttaatgtgt 
ttttggtttt 
taaaatgtat 
tatttagata 
aaaataaagt 
aaaagtataa 
ttaatatgta 
cgtgtgtatg 
atatttgatt 
ggatagatat 
tataattgtt 
tttatattta 
tatgtgcgtt 
attaagtata 
ggttgtggaa. 
gtaggttgta 
cgtttttttt 
tggttggttg 
agtttttttt 
attaaataga 
agtatattag 
tttttgggtt 
ttttatttag 
tttaaatcga 
ttggtcgttt 
ggaggttcgg 
gtcgggtcgt 
cgcgtgtcgg 
gtggttcgtc 
gttacggatt 
gatttaaggg 
gtttttgggc 
cgtcgtttac 
ttttttttac 
ttttttattt 
gtcgtttttt 
cggttatacg 
ttcgtcgggg 
gtttagggag 
ggcgcgcggt 
tttcgtcggg 
gattcgaaag 
tatttgtggg 
aggtttcgtt 
tttatttatt 
tatttgattg 
cgtttttttt 

gggtttgatt 

ttttgatagt 
aagtgtttaa 



atagtaagtt 
gtttttaagg 
ttgtgttata 
tatttagcgt 
gtttattggt 
tttgtttagt 
attttttcgg 
ttagtagggt 
tttatttttt 
ggggtatgtg 
tataatttgg 
aaaaaggtat 
gttttggaaa* 
ttttaatttt 
agttatattt 
atttgatata 
gattatgaat 
attgatagtt 
attatttaag 
tatatatata 
tttaattttt 
ttgtatattt 
gtagaaatgt 
aaagttttgg 
gttttttgaa 
ttaattaaag 
gtattgtggt 
gggtttattt 
tttttttgat 
tttttagatg 
cgtatgtaat 
agggttattt 
aagagagata 
agaaagtcgg 
atttttagta 
tagcgacgat 
gttgtgtttg 
tttagtattt 
gagtttagga 
tcggttttat 
•cgggatatgc 
ggtttttgag 
atgattatga 
aacgagttgg 
gaggtgtatt 
gagtttaacg 
ggtttcgggt 
aatagcgtgt 
ttgtagtttt 
gtgcgcgagg 
tggtcgtcgc 
ttgcgggagt 
ttagttcggg 
tagttgaaaa 
ttcgggtttt 
ggaaaatatt 
tttggtattt 
tttttaatgt 
gaaatatgtt 
ttttttttat 
tcggagttag 
cggaaatttg 
aaataatttt 



1680 
1740 
1800 ■ 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 . 
2760 
2820 
2880 
'2940 . 
3000 . 
3060 ■ 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 " 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
' 4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 




ttgttagttt 
ttagttaggg 
gagattattt 
ataagaattg 
tttgtttttt 
gtgaatattt 
agggcgttat 
ttggtgtttt 
tttttaggga 
attgaagggt 
gagtatttta 
aatttttaaa 
agagaaaaag 
gattaagttt 
gatgtgaatt 
ttagtttttt 
tttggtagat 
gggatcgtat 
tgttttattt 
agagataagg 
tttagtttta 
agaattttgt 
gatagatatt 
aaaggaaagg 
agttttgggg 
ttagtttagt 
aaatattaga 



agtgataagt 
tttgcgtttc 
tgtgatttta 
tttttttttt 
gttttttgtt 
ttttaggtgg 
atggttgggt 
tatatttttt 
tatttatagt 
agattttggg 
tagttacgta 
ggttggtata 
gatttttttt 
aattttatat 
ttataatttt 
tattgtaaat 
tatttattgt 
tttatttaaa 
tatatttttt 
aatttaaaaa 
gaatgatttt 
ttgttattat 
atttagtatt 
attgattggg 
ttgaaagtag 
gagtttagaa 
gtgtatggta 



ttgttttatt 
gtagttattg 
atggcgaagg 
tttttttttt 
tttttgatgg 
ttataggata 
tttgaattag 
tgttttgtaa 
agaaggaagg 
ggtatttagt 
gatttaagtt 
ttttaaatta 
gtttattttt 
aatatttaaa 
atgatggatt 
taatgttgtt 
tttgttaata 
attgtttatt 
gtattatatg 
.ttaaaaaaaa 
ttttgtgttt 
ttagtaagga 
tttttttttt 
tttttttagg 
tgtttttaaa 
tattttttaa 
tttcgtaagg 



cggggagaat 
tggaaggagc 
ttgtgtgttt 
ttttttattt 
gtttgtagag 
ggtgtatttc 
gtatttttta 
gtcgtggatt 
ggattaaagt 
gtgtattttt 
gtttttattt 
attttatttg 
ttttttttaa 
attgtttgat 
ttagaaatta 
gaattttaat 
attataaagt 
attttagtga 
atatttaaat 
aatttgtata 
tatttttcgg 
aaagaagtat 
tagttttttt 
aaattttaag 
ttgtttgtta 
aaggattaaa 
ataagttttg 



gtttcggagt 
gcggtcggtt 
ttattttaat 
ttttttgttt 
ggattaggtg* 
ggattgggtt 
attgtatatt 
agtttttgtt 
gtagtttggt 
agtcgttttg 
aaaagataaa 
ttttaattta 
atggaagaat 
gtgaaggaag 
tttttttttt 
gggtattaat 
gaatttggtt 
taagtggtat 
atttttagaa 
aatgggattt 
attatttttt 
.ttatgtaagt 
gtttaaatga 
ttttttaagt 
tgatttagag 
atggaaagga 



ggcgtgcggg 
taggatatag 
tttttttttt 
agtttttttt 
ggcgtttttg 
tggaagtttt 
ggtattcgga 
tagtattttg 
tttagaggat 
gagaaattta 
taatgaataa 
gggttaaaat 
aaagtatagc 
gtattg.gtat 
tatttaattt 
gagattgttt 
aaatatagag 
tagtgtaata 
taataaaaaa 
tgtgtggaaa 
tttttttgtt 
tttttatatg 
ttttgggtat 
agtttttaaa 
ggttatgaat 
atataataga 



<210> 13 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 13 




ataaaattta 
ttttagtttt 
atgataagta 
atttaaagtt 
tagaaaaatt 
gatgtttttt 
ttcgggaaat 
ttgtgtaagt 
tgtttgaata 
attattggga 
tattttgtaa 
tattgaaatt 
atggtttttg 
tattaagtag 
tttaaaaaaa 
ataagtaaaa 
s - tgagtaaaga 
tgaagatgta 
tattttagtt 
tggtttacgg* 
ttgggaaatg 
cggggtatat 
tttttgtagg 
* aaaatgggag 
aggatatata 



tttttacgaa 
tttaaaaaat 
gtttgaaaat 
ttttaaaaga 
gagaggggaa 
ttttttgttg 
aaaatatagg 
ttttttttta 
ttatgtaatg 
tgatgaataa 
ttattagtaa 
tagtagtatt 
" gagtttatta 
ttttaaatgt 

aaaaggtggg 
ttaatttgaa 
tagtttggat 
tattgaatat 
tttttttttt 
tttataaggt 
tttaatttag 
ttgttttgtg 
tttattagga 
ggagaggaga 
attttcgtta 



atattatgta 
attttagatt 
.attgttttta 
gtttaattaa 
aaatattaaa 
gataataata 
aaaaattatt 
atttttaagt 
tagaaagtgt 
ttttgaataa 
ggtagtaaat 
aatttgtaat 
taaggttatg 
tgtgtggaat 
tagaagaaat 
atatgttaat 
ttgcgtggtt 
ttttagaatt 
tgttataaat 
agaaagatat 
gatttagtta 
gttatttagg 
aaataaaaaa 
gggagaaaga 
ttgaagttat 



ttttgatatt 
tattaaattg 
gttttaaaat 
tttttttttt 
taatatttgt 
ggtagaattt 
ttaaaattga 
tttttgtttt 
aagatagggt 
gatgcgatt£ 
aatttattaa 
aaaagaattg 
gaatttatat 
tagatttaat 
tttttttttt 
ttttaaaagt 
gtgggatgtt 
tgttttttag 
atttttggaa 
agggatatta 
tgtggcgttt 
aaggtgttta 
taaaaaataa 
ataatttttg 
aaagtggttt 



ttttattata 
agtttatgat 
ttttgagaat 
tatatttaga 
ttatatgaag 
taataaaaga 
atttttatat 
tttttttatt 
atattatatt 
tttttgtatt 
ggagtagttt 
aaaattaaat 
tatattagtg 
cgttgtattt 
tgttttaatt 
tttgtttatt- 
ttaaattttt 
tattttttgg 
atagaatatt 
gttcggatat 
tgaagttttt 
ttagaagcgt 
aaggagaaat 
tagggaaaaa 
tttgtgtttt 



5460 

5520 

5580 

5640 

5700 

5760 

5820 • 

5880 

5940 

6000' 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

■ 6720 
6780 
6840 
6900 
6960 
7001 



ttttttttta 
tttttgggtt 
tatttaagaa ; . 
gttatttaag 
aatttgtata 
ggaaagataa 
agagttttat 
attttaagag 
gatattattt 
tgattaggtt 
tattagtgtt ■ 
agggaagaaa . 
ttttttttta 
tgttttttta 
ttaaattaaa 
gtttgttttt 
ttaaggcggt 
ggttaaattg 
gaataaaaat 
tagtgtatag 
aaatttagtt 
ttatttaatt 
tgggtaagag 
aattaaaatg 
"ggatcggtcg 



60 
120 
180.- 
240 
300 
360 
420 
• 480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
* 1380 
i440 
1500 
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• 99% t 



cgtttttttt 
tatttttttc 
tgtgttttta 
agttatatgt 
tatttaattt 
gggtgatagg 
agtttagatg 
aattttgttg 
tt'tttaagtg 
aattgttgtt 
ttgtttttcg 
agaatttgat- 
ttcgggtttt 
tttttttttt 
gcggcgcggg 
gttgggttcg 
' tagttgcggc 
gaaatttttt 
gtcggtttga 
gttttcgggg 
tcgttttagg 
ttgatgtagt 
tcgcgggtag 
tttggaaaaa 
gagttttggt 
cgggagtaag 
taattttaag 
gggattgttg 
agcgatttta 
ggttgagttt 
• aagattgggt 
.tgttttatgt 
tttttttttt 
tatattgtaa 
atagatagtt 
ggagatttag 
ttaggatttt 
tttttgagat 
ttttattagg 
gggtgtttat 
aatgtgattt 
gttatattgt 
' ttttagtttt 
tttagatgaa 
gtatatatat 
aggagataaa 
tgaataaatt 
taattgttat 
gttaagttta 
atttaatttt 
atcgagttta 
gtgta'tcgta 
tagatgattt 
gtaaaaagta 
tataaagaat 
. tagtaatttt' 
taaaatgtta 
gttttt'ttag 
. ggaaggaatg 
tggatgtgtt 
ggtagttttt 
ggggagtgag 
• gttgtattgg 



atagtggttg 
gggtgggata 
tatggtttgt 
tagtaattcg 
ttaatcgttt 
aatcgttttt 
aaaggaaatt 
agttttgtta 
atttaaagcg 
ttgtttggtt 
tacggacggt 
tttgaacgta 
ttgcgcgtat 
tttttttttt 
cgcgggtatt 
ttttttaggt 
ggcgggtgta 
aggtcgttgg 
tcgtagattt 
tagttgtata 
gggattttga 
agggttattt- 
ggtgtagatc 
gagtatagtt 
ttcggttttg 
tgtagtttta 
tattggtttt 
ttttgttttg 
aagagtagtt 
tacg'gttagg 
ttaaaataaa 
gtttcgtgtt 
tcgttggtta 
tcgagatagg 
atatttagta 
ggtatatata 
agtttttttt 
ttttttagtt 
aatttttatg 
gttttttata 
ttttatatta 
cggtttggtt 
ataattcgat 
aaataaatag 
atatatatat 
aaataggtga 
ttttttgttg 
tttaaatatt 
ttgttattaa 
tataaaaata 
gtattaatga 
gatatttttt 
aatttttata 
ttttataggg 
tttataagtt 
tatttttttg 
aagttataga 
aaatgtttta 
tgttcgtatg 
tgtgcgtgga 
ttcggtagat 
gggagatttg 
ggaggagaaa 



cgaggcgtag 
ggtttgttat 
tttaaattcg 
ttttagcgcg 
tttaaagttt 
tagaaagtta 
tggggagtat 
attttaaata 
tttcgtgttt 
ttttaacgtg 
aagtgggtgg 
gcgcgggttg 
tttcgggtcg 
tttttttttt 
tgtagaatgt 
agtagggtat 
gtagtattag 
agtcgaacgt 
gcgcgttggc 
cggcgggttt 
gttgcggacg 
tagatgtttt 
gtgttttcgt* 
cgaggttaga 
tttcgggagt 
ggacgaacgt 
tcgagtttat 
atattggttt 
tttttgaatt 
gggcgggggc 
cgcgaaagac 
agatttcgat 
gaaatacgta 
gtagatagaa 
gttgggggaa 
taggaagagt 
tttgtaaaat- 
tgatttgtgt 
tgaaggatga 
ggttggttat 
aatataattt 
tagttaatag 
tttttcggtt 
aaaaaagttt 
atgtgtatat 
agtatattat 
tagtttaata 
taatagagta 
gagttatatt 
gattattgtt 
ttatagattg 
tttatggtta 
aattatattt 
tttttaaaaa 
ttgaaagaaa 
ttaggatata 
gagagaaatt 
tgggtttgta 
tgcgagtggt 
atggtaggga 
acgtagttgg 
ttgtttgttt 
ttttagtttt 



attttggttg 
tgggttggta- 
cgggatattt 
gatttagtta 
taggacgtgg 
ttattatagt 
attaaataaa 
tttttaattt 
atttgtttgg 
tgttttgtta 
agagtacgtt 
tacgtagaat 
cgcgtttcgc" 
ttttgtcggg 
cggcgggtcg 
ttgttggtcg 
cgggttcgga 
cgtagtttta 
ggcggtcgcg 
gttgttgttt 
gtttaggggt 
ggtgtggagg 
agggtagaag 
ggcgacgtag 
ttgcgggttc 
ttttcgttag 
atgtattata 
aaatattatt 
ttattttatt 
gtacgaggat 
ggatttaggg 
atatttttgt 
gtgtgtatat 
tggggtgtgc 
ttgatagggg 
tgtattttgt 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttacga 
atttttaaaa . 
tatttgggtt 
ttaattgaat 
atatatatat 
gcgtttataa 
gattttgata 
tttgacgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg 
ttagtgtgtg 
ttcgggaagt 
gttattgtat 
atagggagta 
tttgttttta 



attcgtacgt 
ggagattatt 
ataaattttc 
gttaatttcg 
ggtggggagg 
tgatata£tt 
aatattaaaa 
taaatgttaa 
aggtgttttt 
ggaagttcgg 
ttttttagtt 
tttcgggttg 
ggttttcgta 
cgcggcggtt 
gtttcgcgta 
tggggttgta 
gatacgttgt 
gattcggggt 
gcgttgaatt 
aggtatattt 
tttagttcgt 
gttatggtta 
gtttagaaat 
cgtatgtttc 
ggtgaagtcg 
tttttgggtt 
aaggtgttgg 
ttaggtataa 
tgtcgtcgtt 
ttgttaaagg 
gtcggttttt 
ttgttttttt 
aggatgattt 
ggttgtatac 
gtatttgagg 
tattaggaga 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
tgtatttttg 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 
agaaaggtaa 
tattttgttg 
gtcgagagag 
aattaggtag 
agttagtagg 
agcgttggat 
gtgtggtata 
tttttggagg 



tatttcgggg 
tttaagtatt 
ggttgttaga 
aaattagatt 
aggggaaagc 
ttaattaaat 
ggataaataa 
gagcgagatt 
ttttataaat 
gttttcgggt 
gttcggcgag 
ggtcgcgcgt 
gttttttagg 
atttcgacgg 
tcgtgtagtc 
ggaaaggcga • 
tgagtggggg 
cgtaggggag 
cgtaggcggc 
cgtttagggg 
ttttttggat 
tggttcgtgg 
cggcgggtta 
gtcgatacgc 
ggcgattcga 
ttcgggtttt 
aggacggtta 
ttcgatttgg 
gttggataga 
tggtttaggg 
tttaatgtgt 
ttttgtttga 
tggggaggat 
gtagttagtt 
ggaagggggc 
atgtaa'tttg 
tttttataat 
ttgtatttgg 
gggcgtatat 
ttgaatataa 
tggtaattat 
tatgtttatt 
gaattaaata 
atatgtatac 
atatatatta 
tttatatttt 
tattttgttt 
ttatttaaat 
tatatatttt 
tgaagttaaa 
aatatattag 
■taatttttta 
taaaatatat 
atgtaaatgg 
ttatggagaa 
gttgtttttt 
gattaagaac 
tgagaaggaa 
agagagtgtg 
tagggtattt 
gaatggtagt 
gttagagaaa 
ttggaaagta 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040. 
2100 
2160 
2220 
2280 
2340 
2400 . 
2460 
2520 
2580 
2640 
2700 
•2760 
2820 •• 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
'3480 
3540 
3600 
3660 ' 
3720 
3780 
3840 
. 3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800' 
4860 
4920 . 
* 4980 
5040 
5100 
.5160 
5220 
5280 



•_•__••»• 

• • • 

ttttatgttt tttgttgtta ttttaagtaa gaggaaaaat aggtttgttg tgaattatag 5340 

ttttacggtt aaaatagaat gttagttaaa agtgtatgga tattaagttt ataaaatagg 5400 

'atatgggtgg ttttttcgaa agaatataat ttaataataa aagttttttg ggatatatgt 54 60 

ggattaaatg ttttattggt tttagttttt agtttttgaa tagaggtatt gttatgtttt 5520 

cgattgtatt aggaaattag attttggaat aaatgttttg gtattttagg gatgtgtttt 5580 

tagttgaaat gtaatatttt tttatttcgt taattaaatt tataaatttt tttatgaata 5640 

gtttagttga ttgttttttg taaatatgtg aaaaatacgt attattaaaa gtttaggata 5700 

tgaatataag ataaaggtag atatttttgt tttaaattga ttttaggttt tcgagttgta 57 60 

ttgatcgtga ttgggaacga ggttttttat tttttagtgg cgtttatatg gatttatttt 5820 

ttgatjbattt ttttaattat tttttcggtt atagtattaa tacgtaatat tgaggtgttt 5880 

ttagagtgtt tacgtttagg gttttaggat atatgatatt taatggaggt tttgttgtta . 5940 

gatattagtt attattttgg atattaaatg attttaatta taatgttagg agtggtcggt 6000 

ttttggtttt gggatattat gtagttattg agagatttat gagtggtttt tgagattagt 6060 

ataaaaaaga aatagaaagt tataaaaatg ttaatgatgt tattatgtaa atatatgttt 6120 

"ttgtgttttg aaagattttt agtattgtag tgtttgagta taggagagtt ttttttatag 6180 

. ttagtattga aaataaatat tggatataaa taaatattga aaagaaagat- tgttattttt 6240 

tgttggtgat agtggtgttt tttgtaggtt aataatggtt atttatgttt tagattagtt 6300 

ttagaaaaaa gtaagagtat ttagggaggg aggagagagg aataggggaa aggagaagga 6360 

aaggaaaggg gatttgtaat tgtttattat tgatatagga agaataagaa ggttagttgt 6420 

ttttttatag gttttgattg tttagagatt tataattaaa gttagtttaa gaagtttagt 6480 

aaaggtagtt tttttaatta tttttttttt agtatttttt tttaaattta -ttttggtgag - 6540 

ttttgttttt gggtttggtg agtatgggtg ggaaagtata ttgtgttacg tcgatttttt 6600 

ttttttgttt ttggtaaaaa ttttattcgg gtttttagtt tttttggtt't ttttttttat 6660 

ttttttttgt tggatatata cgttttagtc gtgatttatt ggttttttta ggtgaagaag " 6720 

ggtaaagatt gatttggttt tttcgttgaa tgtgttttta gttttatttt ' tttagcggtt 6780. 

tgttgggtat tgtaggtttg tttaaatatg agttattatt tttaaatatg gtgttttttt 684 0 

taattttttc gtgtttgtgt ttagtattat tgttttttta gttatttaga ttaaaatttt 6900 

attgtttttt tcgagggttg attttttttt gttttttata tttaattaag aggtaatttt 6960 

taagtttttt agttttataa tttttttttt tttattgtat t 7001 

<210> 14 
.<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 14 

ttaagttaga tgttttttaa ttatttgtgg ataggttagg tatattttga gtttaatttt 60 

attttatagg tttaatattt tggagttaga aagtttttag gtaaaaagtt tgaagggggt 120 

ttttttatgt tattagatgg atttttgtat ttttagaaga ttttttatat taggaaagat 180 

taaagtatta aggtaatttt ttttggtttt ttgggataat tttaggtttt ggtatgagtg 240 

gtttggaagt. ttttgtttta gttataatgt ttatatattt ttggaattgt tttgtagggt . 300 

ttgtttttta" gtataatttt ttttttaagt tttattgtag ttatagttta ttagttttgt . 360 

ttagtgataa ttaagaaatt aagaattatg tatttacgtt tattttttta gagttatttt 420 

tttttaggat aaagtttagg gttttgttat tgggttttgt taggagtcgt agtcgtaggg 480 

gttgtttatt atttaatagt tttcgtaagt atattgtgaa ggggaagtaa tgattagaga 540 

tagggttagt tgtttagttt ttgtatgttt aggtgtatgc gtatatattt* ttatataggg ■ 600 

tagggtgggg- tgggaagttt attttggtcg tgacgtttag cgcgtttaaa gagtgtaaat ' 660 

ttgcgggggt tatttattat taagaatttt cgtagtaggg ttttaatgat tttacgtgtt 720 

tttgtgcgtg atagtatatg taggtgtttg ata.gtatt.tt tgggtaggta aaaggaagtg - 780 

cggcgtttga tatttgtgtt tcgttttggg tttcgtcggg tattttgtaa ggagggtggt 840 

tttttgttgg agggtataga gatagggcgt . attagtttta gttgaatttt gatgaagttt 900. 

gtgtaagaat cgtttttgtt ttaaagaaat agagaaatta- aattttgata . ataggtttta 960 

ggtgagatgt tagtttattt ggggttaggt tgggtatgta taaattattg tfctgcgttta 1020 

tttaagataa ttttagttgt gattttttga gtattaggta tatagttggg tatttgtttt 1080 

tttttacgtt ttttttttga gtagttaatt tattaagttf atgaagaggt tgttgttgat 1140 

ttgggtattg tattttttga ttttttgttt aattttagtt tgagaaaggt taggtgtttt 1200 

ttattttata ggttcgtttt "gtaagatggg ttagtatgga tatagggttt ttgaggaatt 1260 

tagggttttt ttgaaaaatg gtttttgggg tagtttttgg aaattgattg tttttggttt 1320 

tttgtttttg atgtatatat atatagttgg tgtttatttt gaatttatta ttgtttttgg ■ 1380 



ttttgtatgt 
gtttagggtt 
ggtatatggt 
taaaattagg 
taggaggatt 
atttttatta 
aattgtttag 
agttattatc 
:ataaaaaata 
ggtcgaggcg 
aaatttcgtt 
ttagttattt 
tgaattgaga 
aaaaaaaaaa 
tgagttggag 
cggggttatg 
ttattagtgg 
aattattgtg 
ggttttaatt 
ttgtttggag 
tttaggttgt 
agcgattttt 
attcggttaa 
tttgaatttt 
ggcgtgaatt 
tttagggatt 
atcgattttt 
gttagatagg 
aatattttgt 
taggagagtt 
tgatggggat 
.tttttattta 
gttagggttt 
ttttaagtat 
aattttattg 
tggaaaatga 
cgggaagaag 
agtgttaggg 
gttaatttgt 
tttttttttt 
tttttttttt 
tattgtatag 
tgtttgttgg 
tttttttttt- 
gttttttttt 
ttggtttatt 
tagttgggat 
acggggtttt 
ttcggttttt 
tgtttttttt 
tttttttttt 
tttagtttcg 
ttttttagta 
ttagtataga 
gatttgtttg 
tttttttttt 
acgattttgg 
ttgggattat 
ggttttatta 
tggtttttta 
ttttgagata 
attgtaattt 
ggattatagg 



tttgggtgga 
ttagtattga 
taggggttag 
ttgggtgtgg 
atttgatttt 
aaaaaagaaa 
gaggttgagg 
gtgttattgt 
aaaataggtt 
ggtggattat 
tttattaaaa 
aggaggttga 
tcgtgttatc 
aattagtaaa 
tgagggtttg 
atttattgta 
atagagtagt 
atatgatgtt 
tgttagaggt 
atttggttta 
agtgtagtgg 
ttgttttaga 
tttttgtatt 
tgattttaag 
atcgtattcg 
ggatttttgt. 
tttttttaga 
ttttagttag 
tttagaagaa 
agggtttttt 
aaatggagag 
gaattcgttg 
tataggaaaa 
ttaaaggttt 
gggatagtgg 
gggttggagg 
tatttagatg 
taagaaggaa 
ttttgtgatg 
tggatgattt 
tggtgatgaa 
tttttttagg 
aagaggattt 
gtattatagg 
tgagatagag 
gtaatttttg 
tataggtgtt 
attattttgg 
taaagtgttg 
ttttttttta 



ttttttttga 
atttattgta 
gttgggatta 
cggggttttg 
ttttggtttt 
tttttttgag 
tttattgtaa' 
agttatatat 
tgttggttag 
aagtattggt 
gagttttatt 
ttgtttttta 
tgtgtatttt 



taagggaaag 
atgtagtttt 
gaatagttga 
tggtttatat 
taggagtttg 
aaaaaaaatt 
tgggaggatc 
attttagttt 
gggtacgttg 
ttgaggttag 
atataaaatt 
ggtaggagaa 
gtattttagt 
attatatttt 
ttttggagaa 
gggtgagagg 
atttggagtt 
taattttgat 
atataggtag 
aagatttttt 
* ttggattttg 
ttttcgagta 
tttagtagag 
tgattcgttt 
atttagagat 
. cggaagggtg 
tttttttggt 
aatgcgagtg 
tttttataag 
ttgatttcgt 
gatagatgag 
ttattttgta 
aggtaaaggg 
tttggatgga 
gtaattttgt 
ttgtgaataa 
ggaggagtat 
gattattttg 
gttttgagtt 
aggagttttg 
tttttttatt 
attttaaagg 
taatagttat 
gatttatttt 
tttcgttttg 
ttttttgggt 
tgttattata 
ftaggttggt 
ggattaaagg 
atatataatt 
gatagagttt 
atttttgttt 
taggtatata 
ttatgttggt 
ttaaaatgtt 
atagagtttt 
' tttttgtttt 
tattatgttc' 
gttgatttcg 
attagaggt'g 
ttgttattta 
ggtttaagtg 
cgtggttagt 



atagaattat 
aaggatatta 
gggattttga 
ttgtaatttt 
agattagttt 
agttaggtgt 
gtttgagttc 
ggggaattga 
gtttacgttt 
gagtttgaga 
agttaggcgt 
ttgtttaaat 
ttgggtaata 
aattgtatat 
ggtagtttat 
agtggagagt 
agttttttat 
ttaagttgta 
ttttttggtg 
ttttttttga 
gtttattgta 
gttgggatta 
atgggatttt 
gtttttattt 
ttttaattcg 
gagtgtggga 
tttattgtat 
ttatagatat 
gttttttgta 
tttggagttt 
ggagtagggt 
gttagtaagg 
ggaggggtgg 
gaagagtatt 
cgggttagta 
agtagtggat 
taggggtagg 
tttgtagaat 
tttatttaat 
ggttgggatg 
tttatttaga 
aggaagagtt 
tttagtcgga 
ttattttatt 
ttttttaggt 
ttaagtaatt 
'attggttaat 
tttgaatttt 
tgtgagttat 
tgggattttt 
tgttttttta 
tttgggttta 
tt^ttatggt 
ttggtaggtt 
gagattatag 
attttgttat 
tcggttgaag 
ggttaatttt 
aatttttgat 
tgagttatcg 
ggttggagtg 
atttttttgt 
tttttttttt 



ttggtttttt 
tagaagtagg 
agagggattt 
agtattttgg 
gggtaatata 
ggtggtgtgt 
gggagattgt 
gtgagatttt 
gtaattttag 
ttagtttgat 
ggtggtatat 
tcgggaggtg 
agagcgaaat 
tttgattata 
tttttttttt 
ggtgtatatt 
gttttatata 
taaaaggtag" 
ggtttttgta 
gacgaagttt 
agttttgttt 
. taggcgcgtg 
attatattgg 
tttaaagtgt 
attatttatt 
tagggtagtt 
tagttttatt 
agttaagttt 
gaatgatttt 
ttttaagtat 
ggagcgtttt 
atgtggggtt 
gaatttaagt 
ggagtaaaaa 
aaaataaatg 
gtgtttgttt 
agaaatgtta 
agggagggta 
aatgagaaag 
tagtgatttt 
aaatagattt 
tttttttttg 
tgtatagtag 
attgtttata 
tggagtgtag 
ttttgtttta 
tttttgtatt 
tgattttatg 
tgtatttggt' 
tagtttttta 
tttaggttgg 
agtgatattt 
tagataattt 
tcgaattttt 
gtatgagtta 
ttaggttgga 
tgatttagtt' 
tgtatgttta 
tgtaaatgat 
tatttggttt 
tagtggtacg 
tttatttttt 
aattggttag 



tttgttgttt 
ggtaattgaa 
ttatttaaag 
gaggttaagg 
gtaagatttt 
ttgtagtttt 
agttatagta 
gttttaaaat 
tattttggga 
taatatggag * 
gtttgtaatt 
gaggttgtag 
tcggttttaa 
gtattttagt 
tgtttcggta 
agtagtttag 
tagtgagaaa 
ttttaggtta 
tttgtttgtg 
tattttgtcg 
tttgggttta 
ttataataat 
ttaggttggt 
tgggattata 
ttttatttta 
agggttttga 
ttttgttgac 
agcgttgatt 
gtgtttagtt 
ttaaattatt 
agtagaatgt 
aagaattaag 
ttattttttt 
ttttagtata 
gtgtgggttt 
agtatattaa 
gatagatttt 
tagggatggt 
tttgtttttt 
atttttagtt 
ggattagagg. 
tttttaaagt ' 
gattatggaa 
aaaattgatg' 
tggtgcgatt 
gttttttaag 
tttagtcgag 
atttatttat 
ttaaaattga 
tttttttttt 
aatgtagtgg 
ttgttttagt 
ttttgtattt 
ggttttaagt 
ttaagtttag 
gtgtagtggt 
ttttaagtag 
gtagagatag 
ttatttgttt 
ttttttttat 
attttggttt' 
taagtagttg 
tgttttgtgg . 



1440 
1500 ' 
1560 
1620- . 
1680 
1740 
1800 
1860 
1920' 
1980 
2040 . 
2100- 
2160 • 
2220 
2280 
2340. 
2400 
2460 
2520 
258Q 
2640 
2700 
2760 
2820 
2880 
2940- 
3000 
3060 ; 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 " 
3600 
3660 
3720 
3780 
. 3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040' 
5100 
5160 



tttttttatt 
ggggtttata 
agtattattt 
tatttttttt 
agtttttgtt 
atggtatttg 
atagttttgg 
gttttttttt 
ttttttattt 
ttatatttta 
tatataggag 
tattttttaa 
gagggagtag 
tttagttata 
tcgtgtttaa 
tggaatcggg 
ttacgtagtt 
ggcggagttt 
tttcggcgcg 
ttcgagtttt 
gaaggcgttt 
agtcgttaat 
ttcgggttag 
tttgttttat 
attttatcgt 
gttagcgttc 
cgggaagtgc 
ggagtatttg 
ttttgtgtcg 
ttcgagaatg 
gcgtttatta 
taatagttta 
gagaggtttg 
ggaagtttta 
ggttatttta 
cggaaggtta 
tttttttttt 
ttatttaggt 
ttaagcgatt 
gtttagttaa 
tttaaatttt 
ggtttgagtt 
tatttgaagg 
ttattttttt 
ttttttattt 
tttagcgatt 
tagtattatt 
tttatttttt 
acgatgaagt 
agatttacgg 
tattaggatg 
tagttttatt 
tagttattgg 
taaatatgtt 
cgttgtaatg* 
attttgtcgt 
ttgtgtttaa 
tattacgttt 
taggttggtt 
gggattatag 
attttgtgtt 
•.atgtatatat 
. . atgtttaatt 



tttttatagt 
tttaatttaa 
attttgtaag 
atttaaataa 
tttttttttt 
gtatatagtt 
gttagtaatt 
tttttgtggg 
atttttaaat 
gtagttggtg 
tgtagatgga 
tttttggcga 
gagtattcga 
gttggataat 
tttgtttgtg 
gttttacgta 
aggagggtgg 
tagcgatcgt 
gtttgcgagt 
tttggttgta 
tcgtttcgtt 
tatcgtttcg 
agtcgc.gtcg 
•tggggtagga 
ttcggagttt 
gggtaagcgt 
-.gcgtgcgcgg 
cggagtcggt 
tcgggaaatc 
tttgtattgt 
cgttcgtgtt 
ggtagacgag 
gcggtgaggc 
gggagagggt 
tattgttacg 
taggtagaga 
ttttattggt 
tggagtgtaa 
tttttgtttt 
tttttgtatt 
tgattttagg 
atcgtattcg 
aaaaattaaa 
tagattaaga 
attttagtta 
tgtgtattat 
tgaagtttgt 
gtagtttttg 
ttgggtttag 
ttgaaatatt 
tttaaaagac 
gttatatttg 
agataaaagt 
taaattattt 
ttttagtttt 
taaggttgga 
gtaatttttt 
ggtttatttt 
ttgaattttt 
gtatgagtta 
ttttttagtt 
agatttggga 
tagattttat 



ggaaaatggt 
tttttgtatt 
gaattttgta 
ggattttcgt 
ggtgtttagg 
tttcgataat 
tggatttttt 
taaagtgttt 
ttttttttag 
ataggttaga 
ggagtaggat 
ggattttttt 
acgcggaaat 
gttcggttta 
atagaaatta 
cggtttcgcg 
ggttgtttat 
tttttttttc 
tagcgaggtt 
gtagttattg 
tatagtcgta 
gtcgcgtgcg 
taatcgttaa* 
acgtcggggc 
tatttataga 
ataagagtgg 
agtttgggtt 
aatttaggtt 
ggtaattaga 
ggtttgttta 
tcgttttatc 
cgcgcggtag 
ggaggagttt 
cgttgttcgt 
cggattttaa 
ggtttgttta 
aattaaaaaa 



ttaggattga 
taagttttgg 
aaattttatt 
tttatttttt 
tttggttttt 
ggggatattt 
tttttatttt 
ttgaatgttt 
atttatttta 



tggcgcgatt 
agttttttga 
tttagtagag 
tgatttattc 
gttttttatt 
tatttttaat 
tattggtttt 
ggtttgtttt 
taaatggaat 
tttttttgat 
tttagttttt 
ttttgattat 
gatttaaatt 
gttttaagtg 
tttagttttt 
ttaaatttat 
tgttttttat 
gttttttttt 
gggtagtggt 
tattttagtt 
tttttatttt 
gattttaggt 
tcgcgttcgg 
gaatttgtga 
tattatttat 
gttataataa 



atagggaaga 
ttttttttgt 
ttttttatat 
cgaggtgtta 
ggtttattat 
agggggtttc 
ggtaggtttt 
cgatttagga 
gttcgtcggt 
cgcgcggtga 
ttcgtcgtgt 
ttcgttcgtt 
tgcgtgtacg 
gattgaaacg 
ggggatacgt 
ttttatttat 
tttttggtcg 
agtttgggtt 
ttttttttag 
acgttttttg 
ttttcgtttt 
gcgggttttt 
cggtagtggt 
taggtcgggg 
taggtttcgt 
tgttggttat 
atagttggat 
ataataataa 
tcggtttatc 
gtatttagga 
acggggtttt 
gtttcggttt 
gggaacgtat 
ttacgtttgt 
tatatatttt 
tttttttagg 
ttagtgtttt 
ttttaggggt 
ttgtagattt 
tggtttggtt 
tttagattag 
aatatggtta 
tttttgtagt 
ttttgatttt 
ttttatggtt 
tttttttttt 
acgatttcgg 
tttcgagtag. 
ttagtagata 
gatttattcg. 
ttttttgttt 
tgtttttttg 
ttataagttt 
tttggggagt 



ttgatatgaa 
gttaagattt 
tgaagtatta 
ttaggtatat 
tgagtttggt 
aggaggtttc 
tttaggtatt 
atggtttaaa 
gaggaaggga 
ggtgagggtt 
tttttaagtt 
ttttttttag 
gtttaaattg 
aagttatata 
ggtatttagt 
cggttaggat 
cgcggtaacg 
attttttggt 
agtattgttc 
tgttttaggt 
ttttagtttt 
cgtgttagtg 
tagatcgtcg 
acgtttcgtt 
tatagggaac 
gaggcgaggg 
cgggttcgtt 
ttttcgcgta 
cgcgcggtat 
ttttatacgt 
agtttaggtt 
aggtgaattt 
aaatgtttcg 
aggttcgatt 
ttgggcgttt 
ttttatcgtt 
aaaattgcgt 
gtaattttcg 
ttataggcgt 
attatgttag 
tttaaagtgt 
atggaatata 
tttgtggttg 
aatttttcgt 
aaattgttcg 
attttttatt 
tatatttttt 
tgatttaatt 
tgtttatatt 
attttt.cgtg 
aaatttaatt 
aaaaatggtt 
ttttttgttt 
attttatttt 
tttttttgag 
tttattgtaa 
ttgggattat 
tgaggtttta 
ttttcgtttt 
attttttgta 
tatcggatga 
ttaaataggt 
ttttaaaatt 



gataagttta 
tggcgtgttg 
tttataattt 
tgagttttat 
ggttatatta 
gagatatttt 
ttataattta 
ataagatttt 
atagaatttt 
tagttggttt 
tttttaaata 
tttttttaag 
ttcggtcggt 
gttgtttttt 
atttaggcgg 
tcgcggggag 
gatcggggag 
ttttatttgg ■ 
gagtttcgag 
tatttcgaaa ■ 
gcgcgttcgt 
tgcgcgtgcg 
ggatttagtt 
tttaggaatg 
gggggcgggt 
cgggaaggtg 
tgtagcgggt 
gaattagttt 
ttaggtagtt 
tttcggtttc 
tcggggttcg 
gtaatttgta 
gagattgaag 
tattttagtg 
ggaaatcggt 
tagtatagaa 
tttgtttttg 
ttttttgggt 
tcgttattat 
ttaggttggt 
tgggattata 
tttgtttatt 
ttatttgttt 
' ttttattttt 
ggatagggtt 
tttatttttt 
tagttttttt 
tttatatttt 
tatttgtttt 
ttagtatttt 
tttttttttt. 
attaggtttt 
ttatttttga 
tttttgagaa 
atagagtttt 
ttttcgtttt 
aggtattcgt"* 
ttatgttggt 
tcgaagtgtt 
ttttgttata 
gagggttttt 
tagagtagtg 
tattgatgtt 



5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 . 
6360 
6420. 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
'6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
• 7980 
8040 
8100 
8160 
8220 
8280 
8340. 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
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■ tagggtttat 
ggattgtaaa 
gtttgttggc 
ttagtatttg 
tttgtttgta 
taattttttt 
ttgggaagat 
ttttaaagat 
atttgtggtt 
agcgatgggg 
tcgcgatttg 
gcgcggggtt 
tcgttataga 
cggttaggga 
ttggtacgtt 
" agtttaatga 
gggcgttagt 
tttaggattt 
gtgtagggat 
tttgtttttt 
cggcgatcgt 
tagaggaaga 
atatttggtt 
gagggagatc 
tcggtagagg 
gtatgggtgg 
taggatataa 
tgggtattga 
ttttaggata 
aagtttttgt 
ttagtttgta 
ggtgggggta 
ttataggttt 
ttggagttgg 
ttagattttt 



ttttagtagt 
agtatttttg 
gatttttagg 
atttattggg 
gggggtttat 
ttattttaaa 
ggatttcgag 
aagaaattta 
tagatacgag 
cgaaagtaac- 
attcgcggcg 
attatttacg 
ggtcgtatta 
ttagttgtcg 
ttagtcgggt 
gtttaggttt 
aagcgcgggt 
agattgggcg 
tgtggggttc 
tttttttttt. 
aaagtattac 
gggtttttat 
tatttattgg 
ggttagtagt 
ttggatcgtg 
tcggttttta 
gtttttttat 
ggatggtaat 
tttttatttg 
attgtagtaa 
ggttttagtt 
ttttatttat 
tattttcgcg 
tgtttggtaa 
gggatttagg 



tgatttaata 
gtgattttag 
ggtgtagaag 
ttagaaaagg 
taaaggtttt 
ggttttggag 
gatagtaggt 
aaattagcgt 
tggagtgcga 
ggatttagtt 
attgcgttgt 
cgcgtattgt 
cgtgtgcgtg 
tgtggggttg 
gcggtttttt 
tttcgatatg 
cgggcggggt 
gcgggaagga 
gaaggcgggg 
tcgttttttt 
gcggagatat 
attcggtttt 
gtggggtagg 
tttatcgttc 
attttgaggt 
gtaattgtag 
gtttttttta 
gtatgtgggg 
tagtaatatt 
gtatagcggg 
tatttggggg 
atttatgttt 
tttgtagtcg 
ttgtttgtta 



ggtttgcggt 

ttggtgttta 

gattgttggg 

gtgtttgtta 

aaattttttt 

ggggttatga 

ataaattttt 

atttttatat 

taagggataa 

ttcgggagtt 

tttttggttg 

aggtttttgc 

gcgggtttcg 

tacgcggtgt 

ttagcgcgtt 

gttcggttgg 

tatagggcgg 

gttgaggaga 

ttgggcgcgt 

tttatatttt 

tcgtgttttt. 

ggtttttttg 

aagtttcgag 

gttttgtttt 

ttaggggtgt 

tttttatttg 

gtgtttgatt 

gatgtgggag 

atttattttg 

g'tcgttttgg 

aaagttagga 

tttgtatttt 

taggttttgf 

tttgtttttg 



gggatttagg 

tttaggggag 

tgtgtggttg 

aataaagatt 

aggttttttt 

gtgtttgaga 

ttttaagaag 

ataagtagtt 

attattttcg 

gttttcgtcg 

tttttttcgt 

gtacgacgtt 

cgggttggaa 

ttcgcgcgat 

tagcgggtgt 

gttcgtgttt 

gtttcgattt 

gtcgcgtaat 

tttcgtagag 

atttcggacg 

ggaggttagt 

gttcggtttg 

tttttatttg 

ttattgcgga 

attttgggtg 

gttttgttat 

tggtattttt; 

tagggtttag 

gtatcgtgag 

agttattgtt 

aggtttgatt 

ttttattttt 

tttgaggggt 

ttttttcgtt 



ttagcgggga 

agtaattttt 

cgtgtatatt 

taataaaatt 

ttataggtgg 

agaggtaagt 

ggttaaggta 

atttttgttt 

cgtatttttt. 

attttttttg" 

tttcgtaggc 

ttagatgaag 

gcggtggtta 

gcgtagcgcg 

tagttttcgt 

cgttggtttt 

tagcgttttt 

ggaaatttgg 

tttttttcgt' 

gttataacga 

tttattgtgt 

ttgaagtaat ' 

gggtgaggag 

gattggggtt 

gatttttttg 

tttgggttgt 

tgtaggtagg 

aggtttaagg 

tagcgtttag 

tttagtatat 

ggttttggaa 

tttgttattt 

tgaatatatg 

ttagtcgttt . 



<210> 15 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 15 



9000 
9060 
9120 
9180 
9240 
9300" 
9360 
9420 
9480 
9540' 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920- 
10980 
11001 



ttttagattt 
attgttaagt 
gcgcgaaggt 
tgtgggtggg 
gggttgagat 
tttattgtag 
gtaggtgagg 
tattgttatt 
.tatggaaaga 
ttgagggtcg 
ttacggttta 
agttgttgat 
tttagtgggt 
tgtggggatt 
cgtggtgttt 
aaggagggaa 
cgggttttat 



tagaaatttg 
attagtttta 
aaggtttgtg 
gtgtttttat 
ttataggttg 
tgtagaggtt 
gtattttaga 
tttaatattt 
tttgtgtttt 
attatttatg 
gtttttgtcg 
cgattttttt 
aggttaagtg 
tttttttttt 
tgcggtcgtc 
ggaagggtaa 
agtttttgta 



ggagcggttg 

gtatgtgttt 

gaaatggtag 

tttttagggt 

gatgtgttag 

tttaagcgtt 

attttggatt 

atttgtttgt 

ggtagtttag 

ttaagggaat 

gagttttagt ' 

tttttttatt 

tgttgtttta 

agtatagtaa 

gtcgttgtgg 

ggcggggggg 

tttaggtttt" 



gagcgagaaa 
agttttttag 
ggagggtgga 
tagttagatt 
aggtagtggt 
gtttacgatg 
tttaagtttt 
agggagtgtt 
ggtgatagag 
ttatttagaa 
tttcgtagtg 
ttaagtgaag 
gtaaatcgga 
agttggtttt 
tcgttcgggg 
gttttgcgag 
tattgcgcgg 



atagaggtaa 
agtaggattt 
ggggatgtag 
tttttgattt 
tttagagcgg 
ttagaatgag 
atttttatat 
aagttaagta 
ttaaatgagg 
tgtatttttg 
gagagtagag 
gttcgagatt 
ttaggagggt 
tagaaatacg 
tggggtgtga 
agcgcgttta 
ttttttttag 



gtggtaggta 

gcggttgtag 

gaggtatgga 

ttttttaggt 

tttcgttgtg 

tggtattgtt 

tttttatatg 

tt^ggaaaag 

gttgtagttg 

aattttaaga. 

c 93gcggtaa 

ttttgtttta 

tagggtcgga 

ggta,ttttcg . 

ggaggggacg 

gtttcgtttt " 

ttttttttcg 



60 
120 
180 
. 240 
• "300 
360 
420 
480 
540" 
600 
660 
720 
780 
840 
• 900 
960 
1020 



:22- 



tcgtttagtt 
ggttcgcgtt 
ggagtttgag 
tattcggttg 
acggtagttg 
gtggtgcgat 
gcgtgagtag 
tcgtcgcggg 
cgttattttc 
attcgtattt 
ttaagttttt 
tttcggagtt 
ttttagggtg 
ggtgggtttt 
atttagtgaa 
ttttgggaat 
ttaggggtgt 
aattgttggg 
tatggagttt 
gttttaggtt 
tagttggggg 
gtggtttatg 
taggagttta 
aaaaatgagt 
gggagaattg 
ttttagtttt 
taaggttggg 
aggatgattt 
ggtttttgtt 
gtaagtgtag 
cgttttttag 
tagtgtttta 
attgagttta 
gtaggagttg 
agtaagtttt 
ggtagtatat 
ttggttggag 
gttttggttt 
gtttagtttt 
tcgggtgcgg 
atttgagatt. 
aaatataaaa 
tgaggtagaa 
attgtatttt 
tgttaatgag 
ttttttgttt" 
gcgtagtagt 
ggtttttttt 
cgttttatcg 
gttcgtttgt 
ggatacgaac 
tatagtgtag 
cgatttttcg 
tgtcggtttc 
ttcgcgtacg 
. tgcgtttgtt 
ggagtttcgg 
tttttgtttt 
gcggcgcggt 
tcggggcggt 
gggcgaggcg. 
ttgtagttaa 
agttcgtagg 



tggattttgg 
tgttagcgtt 
tttattgagt 
gagcgtgtta 
gtttttggtc 
ttttgtggcg 
tggtttcgcg 
ttaagtcgcg 
gttttatcgt 
gggttataga 
tatttttaaa 
tattttttta 
aagggaaatt 
ttgtaagtag 
ttagatgtta 
cgttagtaag 
ttttatagtt 
agtggatttt 
gggttgagta 
tgtgtgtgta 
gagtataaag 
tttgtaattt 
agattagtt't 
taggcgtagt 
tttgaatata 
ggcgatagag 
atattgtagc 
gggtatgttt 
tttagtaatt 
tagtgaggtt 
atattttagt 
gtcgtggatt 
gattttatcg 
tagggggtag 
aaatgatgtt 
agatcgttga 
tgggtagagg 
gaggggagta 
ttttttaagt 
tggtttaagt 
aggagtttga 
•attagttggg 
gaatcgtttg 
agtttgggtg 
gaaaggggaa 
gtggtttttc 
gtgaggtaat 
ttggggtttt 
ttaggttttt 
ttggattgtt 
gtggtgagcg 
gtattttcga 
gcggtataga 
gtaagtattt 
cgtatttttc 
cggacgttgg 
agcgatgagg 
aatgagataa 
tttggttcgg 
ggttggcggt 
gaggcgtttt 
gaggattcgg 
tcgcgtcggg 



gggaggcgtt 

taaagttagc 

tgcgggagtt 

acgcgttgcg 

gtggttatcg 

attttatttg 

cgtttacgag 

gtagaggggg 

tgaagagtgc 

tgagtagagg 

atgttttggt 

ggtttgtttt 

attatttatg 

gagttttgtt 

aaatatgta'c 

taaaggttgt 

tttttcgtta 

aggtattagt 

ttattgtttt 

tggagatttt 

ttgtgataga 

tagta.tttcg 

ggttaatatg 

ggcgggtgtt 

ggaggcggag 

tgagattttg 

gtttttaaag 

attagaggta 

gggagtttag 

gaagaaaagg 

gaaggtattg 

tggggtagat 

tgagatatgg 

gaggaaaatt 

gagggggtga 

ggattttgtt 

ggagagtgaa 

gagataggtg 

aaatgggtag 

ttgtagtttt 

gattagtttg 

tatggtggcg 

aatttaggag 

ataaaagtaa 

gttttgtgtt 

ggtcggtttt 

ttattgagat 

tttttaattt 

ttgtagattg 

gcgggtttcg 

cggggtcgag 

gggttgtttg 

gaggttaatt 

tattcgttgt 

gtattttttc 

tattcgtttt 

ttatttttgg 

gagttagatt 

gcgtacgcgt 

tacggacgcg 

ttttcgaaat 

attcggagtt 

attaggtggg 



gaagtcgggg 
gaagtacggg 
ggtattcgtt 
tatcgcgcgg 
tttttagttc 
gggcgtcgtg 
agcggaaggg 
tcggcgggga 
gcgaaaatgg 
tggttgttta 
tttttttgag 
tttttaaata 
ggagggagtt 
gagtttttat 
gtagttatat 
ttttttttgg 
gtttggattt 
aaattttaaa 
ggtttattta 
tttattcggt 
atgtaaagaa 
gaaggcggag 
gtgaaatttt 
tgtaatttta 
gttgtagtga 
ttttaaaaaa 
agaaataaag 
gagataaggg . 
tggttatttt 
gaattaaatt 
gtacggagga 
gaatgtagat 
aagttgagtt 
gggagagtgt 
gaatggagaa 
tcgggtagtt 
ggcgaaggat 
ataattatag 
atgtatttta 
agtattttgg 
gttaatatgg 
ggcgtttgta 
gcggaggttg 
gacgtagttt 
aggcgataga 
tagacgttta 
aggtcgggtt 
tcgggatat't 
taagtttatt 
ggatttgggt 
ggcgtatggg 
ggtgtcgcgc 
ttgcgcgggg 
aagcggattc 
gttttcgttt 
cgttttttgt 
gggcgaagcg 
tcggcgattt 
atattgatac 
taggattggg 
gatttggagt 
cgagtagtat 
agttaggggg 



ttcgttttgt 

tttaatcggg 

gggcgcgttg 

ggtatcgcgt 

gcggggttcg 

cgtaaaggtt 

gtagttaagg 

tagttttcga 

tttatttttt 

tatgtaaaaa 

aaagggtttg 

tttatgattt 

tggaaaaatt 

ttagtaaata 

atttagtagt 

gtagatatta 

tatcgtagat 

aattttttaa 

ggaatttgtg 

ataagaggat 

tgaataaggg 

gcgggtggat." 

atgtttatta 

gttattcggg 

gtcgagatcg 

aaaaaaaaaa 

tagttatgga 

agaaattaaa 

tgttgtaaag 

ttggttatgt 

tttagtttga 

agattaggtt 

agaatttgta 

agtt'tttggg 

tggaatattg 

ttttggagga 

tagagtgtat 

ggtagacgta 

tatacgtttt 

aaggtcgagg 

•tgaaatttcg 

attttaggta 

cggtgagtcg 

tttgttgttg 

gatttaattg 

ggtggttaat 

tgcggagttt 

ttttcgattt 

tgttattatc 

tgggaattcg 

aagggcgagg 

gtaaggagcg 

gttgggaggg 

gggtttaggt 

tcggttagag 

ggtaggtggg 

tgcgtgtttt 

acgttttagt 

gcgtatacgt 

ggacgggcgg 

agtacgacga 

tttatcgcgc 

tgtcggcggg 



ggtttcgttc 

ttatgtcggg 

ggaagggtcg 

gtaattttat 

ttacgtatac 

tgtagtgcgc 

ggtagcgtag 

ggattaggtt 

gtcgtatttt 

tacgttgatt 

tgtttattgt 

tttttagaat 

tagaattttt 

tttttttttg 

ttttttgtat 

gttggaatta 

ttgttgaatt 

attattgtaa 

gatggatagt 

attataaatt 

gtcgagcgcg 

..tatttgaggt 

aaaaataaaa 

aggttgaggt 

tgttattgtt 

aaaaaaagaa 

gataagaagt 

gataagtttg 

aggaagttgg 

ttatttgaaa 

gggtttaggt 

agtgattagg 

aaggagttga 

agttaaaggg 

gattttattt 

agaggtaagt 

agagattagt 

ggttaaaggt 

tagtgaaggg 

cgggtggatt 

tttttattaa 

tttaggaggt 

aaatcgcgtt 

tttttttaat 

ttgagtaggt 

attagagttc 

ggcgagtagc 

ggagtttttt 

gttgtcgcgc 

cggtggagcg 

atgggtaggt 

ttttaattgt 

gagtttggat 

tgatttaggt 

gttatttttg 

gtttgtgagt 

cgtttcggcg 

tttaacggtt 

acgtacgcga 

gtacggttat 

gtagtggtta 

gaatttcgt't 

cgggagggga" 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 • 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
* 3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380* 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
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agcggtcgtt 

tttatttttt 

gtgcgtaaga 

ttatagatag 

tattatttag 

ttcgaatgtt 

ttcgttagga 

ttttatttgt 

ttattagttg 

gatttgggaa 

atttatagga 

aggttattaa 

gaattgtgtg 

taggaaaaag 

tttatttaag 

ttttataaaa 

gttgagttga 

tattgtggaa 

"ggaggtgtat 

ttgggaggta 

gagtgagatt 

tgttaatgtt 

tttggttaat 

ggtgtgtgat 

gttgtagtga 

ttttaaaaaa 

ggaggttaag 

gtaaaatttc 

gtaattttag 

ttgtagtgag 

tttaaaaaaa 

ttaaaaaa'aa 

ttagtatttt 

gttaaaatgg 

aggtatttgt 

gtagagattg 

attttgtttt 

ttttatggtg 

gaaatttttt 

ttttggaagg 

gtttattatt 

tgggttattt 

ttattataag 

tttttttttt 

tt'atttgggt 

gtttttagtt 

tttattgttt 

gagatttttg 

ttttttttgt 

gtagcgggtt 

tttttttatt 

gggagaattt 

atttttttgg 

tttaattagg 

gtttgggaga 

ggtaagaatt 

tcgggtacga 

atttgaggtt 

aaatataaaa 

gtttgaggta. 

gttattgtat 

ttgggttaaa 

tatttttggt " 



ggagtttcgt 
tggttacgtg 
tttcgatttt 
gttgaatacg 
ttatgattaa 
tttgtttttt 
attgagaagt 
atttttgtat 
ttggaatgtg 
tagataggaa 
agggggaaag 
tttaggattg 
ttagatgtta 
aaatagggat 
taaagaaagt 
tgggtggtgt 
gtgtagattt 
aggtgggaaa 
atttgtaatt 
gaggttgtag 
ttgttttaaa 
ttgggaggtt 
atggtgaaat 
tgtagtttta 
gttaagatcg 
aagaaagaag 
gtaggtagat 
gtttgtattg 
ttattgggga 
tcgagattgg 
aaaaaaaaaa 
agaaaaaaat 
ggaaagtcga 
tgaaatttcg 
aattttagtt 
taatgagtta 
aaaaaaaaaa 
tagaagggaa 
tttagtaggt 
gttgtgtagt 
agaagggaag- 
aggaagggaa 
aataagttag 
gttttggtat 
attttttttc 
tttatttttt 
ttagtaaggt 
gatatttggg 
aaggttttgg 
ttgggtaagg 
tgtttttatt 
tggttttttt 
gataaagtat 
gtttatttga 
agaaagtcgg 
tagtttttag 
tggtttacgt 
aggagtttaa 
attagtcggg 
ggagaatcgt 
tatagtttgg 
tttttagata 
agattggagt 



ttttttcggt 
gtttttcgcg 
atcgtttaga 
gaaaaagtag 
agtcgatcga' 
ttttggggag 
atgtttagga 
ggagttagtt 
gaagattttg 
agaagttttg 
ttagattata 
taaggtgttt 
ttggtgtgat 
tgtgaagttt 
ggagttgtga 
ttagtacgtt 
ttgggtttgt 
attataagat 
ttagttattt 
tgagttaaga 
aatagaaaag 
aaggtaggtg 
tttattttta 
gttatttggg 
tgttattgta 
gttgggtttg 
tatttgaggt 
aaaatataaa 
ggttgaggta 
gttattgtat 
aaaagaaaga 
attagtttta 
ggtgggtgga 
tttcgattaa 
atttgggagg 
agatcgtatt 
ttattagttt 
attttatggt 
agttttggaa 
gtttttaatt 
gggttggaaa 
gaaggagtaa 
tattattttt 
tagagtttgt 
gttggtgtat 
agggtttata 
ttgtattggg 
gaagaaaatg 
ttttagtttt 
agtattttgt 
agatggttta 
gggttaggta 
tggttagcgt 
cgttaatagg 
tttagagttt 
atgaggtggg 
ttgtaatttt 
gattaatttg 
tgttgttgtg 
ttgaatttag 
gcgatagagt 
gtataggtag 
ttggtttaaa 



tcgttgtcgc 
ggttttggtc 
tgttgggtgt 
ttgtatggtt 
1 gtagtttgga 
attaggggag 
aaatttgaga 
gagtttttat 
tttttttttt 
ttttggatta 
aaatgtttaa 
cggaattttt 
tattaggttt 
agtatgtttg 
gtgatgtttt 
aaaattttag 
ttttatgtta 
attaattaat 
gggagggtga . 
tcgtgttatt 
gaagttaagt 
gattatttgt 
ttaaatatat 
agattgaatt 
ttttaatttg 
gtgatttatg 
taagagttcg 
aaaattattt 
ggagtattat 
tttagtttgg 
ataggaggtt 
ggttaggtgt 
ttatgaggtt 
aaatataaaa 
ttgaagtaga 
attgtatttt 
ttatggatag 
tttgttgtgt 
atagaaaaag 
taagtttgtt 
tgaggttatt 
gtttttttat 
gtgttttttt 
ttggtatttt 
tgaataaata 
ttatttgttt 
gtttttattt 
agtttaaatt 
tagttttat'a 
taaaacgttt 
agtgtttaag 
taagattatt 
tga.gtttagt 
aagtaaggtt 
tggttgtttt 
gagtgagtgg 
agtattttgg 
gttaatgtgg 
gtacgcgttt 
gaggtagaat 
gaggtttcgt 
gtgtagaaat 
gttgtttttt 



gttttgggtc 
ggggatttgt 
. cggggttttt 
tgtggtagat 
ttagtatttc 
gatgtggaga 
ggtagagaga 
tttttttttg 
ttagggtgga 
taagtattta 
agaggtggaa 
taggtatttt 
agagaattag 
gtagaaatgg 
agataaaatt 
tttagagttt 
gttagttttg 
tgaaaaggag 
, ggtagaagga 
gtattttagt 
acggtggttt 
aattaggaat 
aaaaattagt 
attttaatcg 
ggtgataggg 
tttgtaattt 
agatttgtta 
ggttatggtg 
ttgaatttag 
atgagagagt ' 
gagaagtttt 
agtggtttat 
aggagtttaa 
aattagttag 
gaattgtttg 
agtttggaaa 
tggtagagtg' 
attcgattgg 
aaattttttt 
ttttgagtga 
gtattttagt 
tgttaggtag 
tgttttgtaa" 
ttttgttttt 
tatttattgt 
ttattaattc 
tagtgttttt 
tttatttttt 
tttttgttgg 
tattttgttt 
gggattttag 
ttataggaaa 
tgtgtttgtg 
gatgtagtgg 
gttttatatt 
tcgagttaaa. 
gaggtgaagg 
tgaaatttta 
gtagttttag 
ttgtagtgag 
tttaaaaaaa 
ttattaggaa 
atgtagtttg 



ggtgggtagt 

tcgcggaatc 

ttggtttttg 

ttgagtcggg 

gattttcgcg 

gggaagagtt 

gattttgttt 

ttttggtttg 

tttggagaaa 

ggagtatttt 

aaagagattt 

tattatcgga ' 

gtttaggtat 

ggcggaaatt 

ttataaaatt 

gggtgtaagg 

agttattttt 

ggttagttac 

ttatttgaat 

ttgagtgata 

atatttttaa 

tccjaggttag 

cgggtatggt 

ggaggtaaag 

tgaggttttg ' 

tagtattttg 

ggttaatata 

gtgtgtgttt 

aagatagagg 

aagattttgt 

aagttatatg 

atttttaatt 

gattagtttg 

ttgtggtggt 

aatttaggag 

atagagcgag 

gagggtgggt 

gatggttgtt 

tt.ttttagaa 

agatagggag 

ttagggtttt 

gagtttagag 

ataaaatgat 

agtatttttt 

tttatttata 

gataaggttg 

ttttatttag 

tttttttatt- 

ttgtagaata 

ttttatttgt 

ggcggagtta 

ttttgtggga 

atattcgtat 

ggttaaggga 

ttattttttc 

aatttttggg 

taggcggatt 

tttttattaa 

ttattcggga 

ttaagattta 

aaaaaaattt 

gttgtttgtg 

ggttaaggtt 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
• 5640 
5700 
5760 
5820 
5880- 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 



24-: . 




aaatattatg 
tattattttg 
ttgtagtggg 
ataaattttt 
ttttattgat 
cgatggtacg 
tttagttttt 
tttttagtag 
ggtgatttat 
. tagtttgttt 
tgtagtgata 
• attttagttt 
tttttttttt . 
gaggattaag 
attatattta 
tttggttttt 
tttagtgttg 
atttatttag 
tatatgtata 
ttatttttta 
taggacgagt 
attaaggagt 
tttaggaagg 
tataattgag 
taagtagatt 
gaataaggac 
ttttgtgttt 
gggtataagt 
gtatgtgttg 
ggtggtggat 
tgggtttttt 
ggggttgagt 
agttgttgga 
ttttgagttt 
tagttttttg 
aggaattgtg 
ttaaagtaaa 
aagaattgtt 
tttatttaat 



aaggtattag 
attggggaat 



ttatagtgat 
tttattggtg 
ttatggtttc 
attttaattt 
tttttttttt 
attttagttt 
taagtagttg 
agacggggtt 
tcgtttcggt 
ttgttttttg 
cgataatagt 
tttgaatagt 
. ttagtagaga 
tgattttttt " 
gtttgatttt 
ggttatgtgt 
aggttttagg 
agtatgtaaa 
ttagggatag 
gagaagtttt 
ttgtggagtg 
ataatgttta 
.gggcgtgggg 
gttattttgg 
gatattttat 
ggtttttata 
tttagtaaga 
gttaggcgtc • 
ttacgtataa 
gattttcgta 
attttatttt 
agttggtttt 
tgatgagtag 
tgttttgaag 
gttgttatta 
ttggaagata 
ggtttttaga 
ttggtgtttt 
gatatgagga 
gtttgtggag 
atttgatttg 



tttttttatt 



gttggattat 
gtgtcggggt 
aattagggtg 
tttgagatcg 
attgtaattt 
ggattatagg 
tttttatgtt 
tttttaaagt 
tgttttgaag 
ttattgtagt 
tgggattata 
tgaggttttg 
attttagttt 
attttaaatg 
ttttagttgt 
taggtagtag 
attaggagta 
ggggttaaag 
gggtttttta 
ggagatattt- 
gattaatagt 
aggagtaggt 
gtgggcgtaa 
ttggaattta 
taggttttat 
agttattttt 
•gtattttttt 
gggtacgtga 
agtttgtatt 
gttttatgtg 
gtttttgatt 
tttttgcggt 
gaaaatgatt 
gataggattg 
agttttgtaa 
ttatttatgt 
gatttttttt 
ggtttttttt 
tgaaattaga 

g 



atgtgtgaga 
tgatgtgtat 
agaggagaaa 
ttgtgattag 
agtttcgttt 
ttatttttcg 
tatgtgttat 
ggttaggttg 
gttagaatta 
tagggtttta 
tgtaattttt 
ggtatattat 
ttatgttgtt 
tttaaaatgt 
agagtttttt 
ttttgttttt 
agagaagtta 
gtggtgggtt 
gtagttagtt 
agggttttgt 
' gatttttttt 
aattttttta 
atttagttat 
gtagtaattt 
ttattaaggt 
taaggtttag 
tttgtaggat 
ttatttgttt 
agttattgag 
ttttgagcgc 
agggtatata 
attatttttt 
tgcggttttt 
ttggggaggt 
atgggttgta 
aatagtttta 
taaagtttag 
ggtgtggaaa 
agatttttta 
tttagaatat 



tatggagaat 
tattttttat 
aatgggttgt 
aatgtgtaat 
ttgttgttta 
agtttgagta 
tacgtttggt 
gttttaaatt 
taggcgtgag 
tttagttttt 

cgggtttaaa 

tatatttggt 
taagttggtt 
tgggattgta 
tttagagttt 
gtggtatttt 
aatgattttg 
tagggtgggt 
tttaaagatt 
gtttatgttg 
aagttgagat 
tgagtttggt 
gtgtttgata 
gtgtatattt 
ttggtttttt 
ttgaagttgg 
gttcggcggg 
aaggatgttg 
gttttgttgc 
gttgagcgtt 
cgtatgtatt 
ttttatagtg 
ggtagggttt 
gaacgtgagt 
gttatagtaa 
ggagtgtata 
ggttgtttta 
attttttgga 
tttggaagtt 
gtttgatttg 



tggttttaag 
tttttttatt 
tttttttagg 
tgaggtgtga 
ggttggagtg 
atttttttgt 
taatt.ttgta 
tttgatttta 
ttaacgtgtt 
taggttggag 
cgattttttt 
taattttttt 
ttaaattttt 
gatgtgagtt 
tttagttgtt 
taaggttata 
tttttttttt 
attagttatg 
gttttagagg 
gtttattttg 
tgagtagaag 
gagttgattg 
tttagagggt 
agtttagttt ■ 
tattttttta 
tgcgttttgt 
gtttaggacg 
ttaagtattt 
gaaagttttt 
acggttaagg 
tgagtatgta 
tatttgcgga 
agtgataagg 
atatagtttt 
ggtttggagg 
ggtattgtaa 
agaagttagg 
gatgtaggag 
ttttggtttt 
tttataggta 




<210> 16 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo* sapiens) 
<4 00> 16' 



8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180' 

9240 

930.0 

9360 

9420 

9480 

9540 

9600 

9660 • 

9720 . 

9780 

9840 

9900 

9960 

10020 
. 10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11001 



gtggggttgg 
' attggttaag 
atgggttgtg 
aggggagttt 
ttgtgagttg 
tttttttaag 
gaggtgttgg 
gtttgggtta 
.ttttttggga 
ttttgatata 
ttgttgtttt 



taggggtgtt 
agttgttatt 
gggttttgtg 
agggtttggt 
agggtttggg 
tttggggtgg 
gtttgtgtgt 
gggttgtagt 
gttgtgtggt 
atgttttttt 
tgttttttgt 



gttttggtat 
gttgtgggga 
ggttaggagg 
.tttgggtttt 
gttggtgtat 
tggagttttg 
gtagggttta 
tggttgtgtg 
tttttttttt 
tgtgttgggt 
gtggttgttg 



agtttggggt 
ggggagtttg 
ggaatggggt 
gttgttggat 
tttttttgtt 
ggggagaagg 
ggttgaggtt 
ttgtgtttgt 
tttttatttg 
ttggtttttg 
tttgggtttt 



ttggtagtgg 
gtttgttggg 
tgggtgggtg 
ttgggggttg 
ttgtttgtag 
ggttggggga 
gggatgtggg 
ttggggtgtt 
ttttttgttt 
tgttttgttt 
gattgagggt 



tgggtggggt 
attgtaggta 
agtagtgggt 
tgaggaagag 
ttggaaaatt 
gttgtggagg 
tggggtgtag 
gttttttttt 
tagttttttg 
gttatggtag 
tgatagtttt 



60 
120 
180 
240 
300 
360 
420 
.480* 
540 
600 
660 



— • — 

• • • • 

-:2s- 



tggttagggt 
tggggtaatt 
ggtaaagttt 
attgttttta 
gttaggtgtt 
ttttgttttt 
gttttttgtt 
gtggagggga 
ggttggtgtg 
tgtttggtta 
tgtgtgatgt 
tggttagtag 
atgatatggt 
tggttgatgg 
tgggtgttgt 
agaaatgttt 
gtggttatgt 
aggtttgtta 
gtttagaggt 
ttgttgtatt 
agaagagttt 
tgtgtttgga 
tgttgttgtt 
tattttgtgg 
gtttttttta 
gttgggagtg 
tgtttggggg 
ggtttggtgt 
gtgaggattt 
atggtgagtt 
tttttagttt 
tgtatggtag 
ttggaggtgg 
tttattgttg 
agaagatgta 
agtgtgggat 
atgtgtgtgg 
tgggtgagaa 
ttattagtta 
tgggtttggg 
ttgagggtgt 
ataaggtggt 
tggtgttgga 
ataaggtggt 
ttattgattg 
gtggttttgg 
ggataattgt 
ttgttttgtg 



ggtgttaggg 
ttttttgagg 
tttgttttga 
aaagggttat 
ttggggtgtg 
ttttttggtt 
ggtgtttagg 
gaagtgttgg 
tgtttttgta 
ttttaggtag 
gattattgtg 
tgtttatttt 
gagtttggtt 
tgtagaggtg 
gttggttgtt 
taagtgttat 
gttttatggt 
tttgttttta 
tgtggttaat 
ttgtgttttg 
gttgggtttt 
tagttttttt 
gttgttgttg 
tggtagtggt 
ttgttggatg 
tggttttttt 
gtttttgttt 
gggtggtgat 
tagtttgttt 
tgagagtttt 
tgagtagttg 
ggttgaggtg 
tggggataag 
tgtgttgtgt 
ttggttgatt 
tatgatgtgt 
tatgtggttt 
gttttatgag 
tatgaagatg 
ggggtttttt 
ttttgttgtg 
tgtggttgag 
gagtttttat 
tgaggtgttg 
tttttttttt 
tttgtatttt 
agtgttgtta 
ttttagtttg 



tgggtattgt 

tgggaggtgt 

atgatttttt 

tgtgtttgaa 

taggttgttt 

tgggt<Jggtt 

ttgtagggtt 

gttggggtta 

ggagagtgtg 

ttgttgttgt 

gtgtagaatg 

aagtttttgg 

gtgttttgtt 

gttgtggttg 

£ttagttatt 

ggtaagtatt 

tggttgggtt 

gttgggtttt 

atgtattgtg 

gagtgttgtt 

gtggtgttag 

agtgttggtt- 

tttttttaga 

agtttgggat 

aagtatgagt 

agtgagtgtt 

ggtttggtgt 

ggtgatgatt 

ggtggttatt 

ggtgataatt 

aatgtgtatg 

gttgaagtgg 

gttgttgggg 

gataagagtt 

tggttttatt 

tatatgtgta 

atgtgttagt 

tgttaggtgt 

tatgttgtgg 

ggtgtttttg 

gtttgtttta 

ttgttggtgt 

ttgttggtta 

agttagggtg 

atttagagtg 

agggagtggt 

tagtggtggt 



gttttttttt 

tggttttttg 

ttgaagtgga 

tagttttttt 

tggttttttt 

ggtttgggtt 

gatgtttttg 

ggtggttagg 

ttgggtagat 

agtttaataa 

tttttttttg 

tggtgtatga 

tggtgttgga 

tggttgtggt 

tgtagatttt 

gttatttgtg 

ggggtttgtg 

tgttgttgtt 

ttgagttgta 

gttttttttt 

agtggttgtt 

ttgttgttta 

agttggagga 

ttgagttttt* 

tgggtttggg 

gtgaagagtg 

tgttgttgtg 

ataagagtag 

ttgagggtta 

tgtatgtgtg 

tggaggttta 

ttgttggggt 

ttttgggtgg 

ataaggattt 

tatgtattat 

gttatttggg 

attgttttat 

gtggtggtaa 

ggggtgtggt 

gttttgatgg 

tggttgagta 

agattatgta 

agtttatggt 

tttatttggt 

ttttttgtta 

gggggtggtg 

tttatttttt 



tttgtattat 
gttttttttt 
tattttattt 
tttggaagtt 
tttattggtg 
tttattttag 
tttagttgag 
gtgttgtatg 
gatgttggat 
ttagtgtatt 
tgtgtataag 
taatttgttt 
ttttatttat 
tttgggggtt 
tgattttgtg 
gggtggtggt 
ggttgttatg 
tgttgtggag 
tgtgttggga 
ttgtggtttg 
ggttgagtgt 
taaggagttg 
ggttgtattg 
tggttgtttt 
tagttatggt 
tggtggggat 
ttattttggt 
tagtgaggag 
tttatgtttg 
tattttgtgt 
tgtggaggag 
tgttggttta 
tttgggagag 
ggttatgttg 
ttgtgggaag 
ttttaagttt 
ggagtatatg 
gtttgtatag 
tggtgtggtt 
taagggtaag 
gttgagtttg 
ttttttgtat 
tgagttgggt 
ggttgggttt 
gtttgttttg 
tgtagggttt 
ggtggtttta 



tttttttaat 

tttttatttg- 

aaattgggta 

ttagtattta 

gttgtttatt 

agggtagttg 

ggaaggggaa 

gtttttattt 

atgatggagg- 

aagggttttt 

aatgtgttgg 

aatttggatt 

attggttgtt 

gagttgagtt 

gtgttgtgta 

ggtggtggtg 

ttggttattt 

ttgtttttgg 

tttggtttgg 

gatttgttta 

gagttgtttt 

ttttttgttt 

ttttttgatt 

gatgggttta 

gatgagttgg 

gtggttgttt 

agtttggatg 

attggtagta 

tatttggttt 

ggtaagggtt 

gaggaagtgt 

gggttttttt 

ttgttgtggt 

tggtagtatg 

aagtttatgt 

tttgtgtgtg 

tgtatttatt 

taatgtaatt 

ggggtgttgg 

tttgattttt 

aagtagtagg 

gattttaagg 

tttagttttg 

gatggttgga 

ttgttgttgt 

attgtgtttg 

tttggtttta- 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
'2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3501 



<210> 17 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 



.<220> • 

<223> chemically treated genomic DNA (Homo ' sapiens) 



<400> 17 

ttgagttaag gtatgaagta gtgaggttag 
gtggtgatgt tgtggttgtt ttgggtatag 
tggggtgtgg gttagggttg tgtagtagtg 
tgggagagaa atggttgatg gtttggttgt 



gtgaggttgt tgagaggtgg agttgttatt 

-tgggttttgt gtgttgtttt tgttgttttt 

atagagtggg ttggtgaggg gtgttttagg 

tgggtttggt tgttaggtga gtgttttggt 



60 
120 
180 

240 . 
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ttagtatttt ggttgttttg ttggggttga ggtttagttt ggttgtgaat ttggttagtg 300 

ggtagaggtt ttttagtgtt attttggggt tgtgtaggaa gtgtgtggtt tgtgttagta" 360 

gtttggttgt ggttgttttg ttttgttgtt ttaggtttag ttgtttggtt gtgaggtgag 420 

ttatagtaaa gatgtttttg gggaagttga gtttgttttt gttgttgggg ttggggatgt 480 

tggggagttt ttttaagttt gttagtgttt . tggttgtgtt ggttgtgttt tttatggtgt 540 

gtatttttat gtggttgatg aggttgtgtt gttgtgtgaa tttgttgttg tatatttggt . 600 - 

atttgtaggg ttttttgttt gagtggatgt gtatgtgttt tgtgaggtgg tattggtgtg 660 

tgaattgtat gttgtatgtg ttgtatgtga agggtttgag gtttaggtgg ttgtgtatgt 720- 

ggtgtgttat ggttttatgt tgtgtgaatt tttttttgta gatggtgtat gggtagggtt 780 

gggttagtta gtgtgttttt ttgtgttgtt gtagtgtggt tgggtttttg tagtttttgt 840 

tgtatgatgt gtagtggtag ggttgtagta gtttttttag gttatttgga gttttggtga 900 

ttttgttttt gttgtt'ttta -aaagggggtt ttaggttggt ggttttagtg gttattttgg 960 

ttgttttggt tttgttgtat agtgtttttt ttttttttat gtgagttttt atgtgtgtgt 1020 

ttagttgttt agagttgggg aagtttttgt tgtatggaat gtatatgtat aggttgttat 1080 

tgaagttttt gggtttgtta taggttaggt gtgggtatgg gtagtttttg aggtggttgt 1140 

taggtgggtt ggggtttttg ttgttattgg tttttttgtt gttgtttttg tagttgttgt 1200 

tgttgttgtt tgtgttgggt ttgtttaggt tgttagggta gtgtggtggt ggtgttaggt 1260 

tgagtggggg ttttttgggt gagatggttg tgtttttatt atgttttttg tagtgtttgt * 1320 

" tgggggagtt gtgtttttgg tttagtttgt "tgttatagtt atttaggttt ggtttgtgtt 1380 

ttatttagtg atagaggaga ttaggtttgt tggggtggtt ggggggtttg ggt'tttgggt " ■ 14 4 0 

. tgttgttgtt gttgtgaaat gggttggaag gtggtgtggt tttttttagt ttttggaagg 1500 

•gtagtggtgg .tagtgatggt agggtgagag gtggtttttt gtaggtggtg. gggttggtgt ' 1560 

tgggagggtt gtttgggtgt gggggtagtt tgtgtttagt tagtggttgt tttggtgttg 1620 

tggagtttgg tgggtttttt ttggataggf ttaggttata aagaggggag tagtggtgtt 1680 

ttgaggtata gagtgtggtg gttgggttgg gttttgatgt gtatagtttg' gtgtagtgtg 1740 

tgttgattgt ggtttttggg tttgagggtg gttttgtggt aggtggtggt ggaggtttga 1800 

ttggggatgg gtagtaggtt tggatgattg gtgtggtggt ttgtaggttt tggtttggtt I8 60 

gattataggg tgtgtagttg ttgttgttgt tgttgttgtt ttgtaggtgg tagtatttgt 1920 

tgtggtgttt gaggtgtttt ttgtatagtg ttatgaggtt ggggatttgt aggtagttgg 1980 

tggtggttag tatggtgttt aggtttggtt tagtttttgg ggttatggtt gtggttgtag 204 0 

ttgtttttgt gttgttagtt aggtggttgg tgtagatgaa gtttagtatt aggtggaata 210p 

tggttgggtt tattatgtta tggtttaggt tgagtaggtt gttatgtatt attagggatt . 2160 " 

tgaggtaggt gttgttggtt gttagtatgt ttttgtgtgt gtggaagagg gtgttttgta 2220 

ttatgatgat tatgttgtat aagaagtttt tggtgtgttg gttgttgagt tgtagtagta : 2280 

gttgtttgga gtggttgggt gtttttattg tgtttagtat tgtttgttta gtatattttt 2340 

ttgtggggat atatattggt tgggtgagag* ttgtgtggtg ttttggttgt ttggttttag 24 00 

tttggtattt ttttttttta tttttttttt ttttagttga gtgggggtat tagttttgtg 24 60 

gtttgggtat tggtgaagga ttggttgttt tttggagtgg gagtttaggt tggtttgt'tt 2520- 

ggattaggag aaggagtagg aggtgagtgg ttgttggtgg aggggaggtt agggtggttt' 2580 

gtatgtttta gggtatttgg ttgggtgttg gggtttttga gaagaaaatt gtttaggtgt 2640 

. . " agtgattttt ttggagatag ttatttgatt taagtaaaat gtttgtttta ggaaaagtta- -2700 

^^^^ tttagggtgg agaattttat ttaagtaggg agaaagggag ttgaggaatt agtgtttttt 27 60- 

gttttgggag aagttgtttt agttggggga agtgatatgg aggaggggag tgtggtgttt 2820 

gttttggtgt tgttttggtt gggggttgtt aatttttggt tggggtttgg gtggtggttg 2880 

tgtggggagt ggaggtagtg gttgttgtgg tgggtagagt gtgaaggttg ggtttggtgt * 2940 

. ggggagggtg ttatattggg gtaggaggtt gaggtaggaa gtaggtgggg gggagggggg - ,3000 

agttatgtag tttttagggg agggaggggg tagtgttttg ggtgggtatg gtgtatagtt • .3060 

ggttgtggtt ttgatttggg tttgtgtttt atttgtgttt tggttttggt ttgggtttta 3120. 

tatgtgtggg tttggtgttt ttttttgtgg tttttttggt tttttttttt ttggaatttt 3180 

gttgttttaa atttggggaa aagtttttta attgtagata* gggtgggagg agtgtgttgg 3240 

ttttaggttt ttggtttgta gttttttttt gtggttttta aatttggtgg tagagtttgg . 3300 

agttgagttt tgagtttttt tgtttgttgt ttgtttgttt gattttgttt tttttttggt 3360 

ttgtggggtt ttgtggtttg ttatttgtgg ttttggtggg ttgggttttt ttttttgtgg 3420 

tggtggtagt ttttagttga tgttttattt gttgttgtta ggttttgagt tgtgttaggg 3480 

tagtgttttt gttagttttg t 3501 

<210> 18 • 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 




<220> . 

<223> chemically treated genomic DNA (Homo sapiens) 



.<400> 18 

tttttatagt 
tagtataaaa 
tttttttttt 
gggttgttat 
ggatagggtt 
ggttgtattg 
atttaggaga 
agattattta 
aagtagtggg 
gtagggatta 
gttatttttg 
aaatttagaa 
gtgttgagtt 
aaatggtttt 
ggattttaag 
gatttttgtt 
gttttgtttg 
tgatgggtgg 
ttgtgttttt 
gtagttgagg 
attttttgag 
tttttagtat 
ttgttgattg 
tatttttttt 
tgtgaggggt 
ttatttggtt 
tttttttaaa 
gttataattt 
ataggggtta 
ttggagagta 
gtggtaaaat 
aataaattat 
agtgggtgat 
ttattggtag 
taaatgttat 
taaagtggtt 
attttatatt 
ttttttaaat 
attttataat 
gaggtttagg 
ggggttatat 
tttatatatg 



gtaaatgtgt 
tttaaatatt 
taaatttagt 
tttaggtttt 
tgtttatttt 
ttattgaaag 
tgtttagtaa 
ggaatatttg 
ggtttaggat 
ttatattgat 
gaagtttttt 
aatgtttatt 
aattattttt 
tggaaggtaa 
ggagttgtgt 
ttgtttttgt 
tttgtgtgtg 
gtggtattgg 
tgttgaggtg 
ttattttgtt 
gaggaaagag 
tttggtgtgg 
taaaattggg 
g'taattaatt 
gggggttgga 
gtttagtgtt 
gagggtgttt 
gtttttgaaa 
ggtgtatgat 
gagaattttg 
ggaaaattgt 
ttatattaat 
ttttgtaatt 
tattttatat 
aaattggaat 
ttttatgttg 
tatagttgtt 
gtttttagga 
gtttggtgtt 
gattatattt 
ttgaaaatta 
gtatgtagta 



ttttattatt 
taaatatgat 
tagtgtttgt 
ggtttttttt 
gatattatta 
tttattattt 
atgttgattg 
ttaaaatgtt 
tttgtatttt 
aaatttttat 



aaggataggg 
gagtgaatga 
ttatataagt 
gttttttagt 
gttttgtgtt 
tattttttgt 
tgttgttgta 
ttattggtgt 
tagttggttg 
gaggtggtgg 
ttggaaatat 
gttttttaag 
tgaaagtttt 
tttttagatt 
atttgttttt 
tgtttttttt 
gataagagtt 
tttattggag 
ggagaaagaa 
aagtggattg 
tatgtgaaat 
atagatgttg 
tttaagtgta 
ttggaaaggt 
tttgttaatt 
ttttttagaa 
tggttatttt 
aaatattttg 
ttaaattttt 
taagatgttt 
tttaaggata 
tattttatat 



ttttggagta 
tttgttggtt 
gttgtttgta 
tagaaagttt 
gtttatttat 
gattttgagt 
aattatgatt 
tttgtttttt 
gatagttttt 
tatttttaag 
gttgtatgtt 
tgaaatgata 
tttagtagtt 
gagtggagtt 
tattttttta 
ttttttttgt 
ggttttattt 
gggggtaggt 
ttttttttgg 
ggaggggaat 
ttggtttttt 
gttttgtttg 
ttttgatttt 
tttgtttagt 
tgaagtgtag 
tgttgtaaga 
tttttttgtt 
ttttggtaga- 
aatgggagta 
gttagtggga 
agttttattt 
tgtagtgaga 
gggattttga 
tttaatgtat 
aatttttatg 
aatgttttta 
tgtatgggta 
ttt-atatttt 
tttgatagtt 
ttgttattgt 
gggattgttt 
tagtatttag 



attttattta 
ttgtttttgt 
atgttttttt 
tttttttttt 
atatattggt 
agtttgagga 
gtttttaata 
gagattttat 
tgatgtttgt 
ggggattttt 
gtttttttag 
ggtgaataga 
tttattgttt 
atttttaatt 
ttagaaattg 
tttgtagttg 
atgggtttat 
gagtatgtga 
gttggtatat 
ggttattttt 
aagtaggtat 
agaggagagg 
tatttgtggg 
tttttaaagg 
agatgttagt 
ttttaagttt 
ggagtttgta 
ggttgtaagt 
ggatgggtat 
aagttgtttg 
tttaaagtat 
ttttatatta 
tattatgtat 
gaattatttg 
tatttttata 
gtgttgatga 
tgtatttggg 
gtttttattg 
tttggataat 
taaggagatt 
tttttgatat 
t 



aaattgtttt 
ggttttattt 
ttttttgtag 
. tttttagtgg " 
tataagttta 
ttttattaaa 
tataaatgta 
tttgggaggt 
atgtagaagt 
ttttttaggg 
gttagtaatt 
tgaatgtaag 
ttagttgtag 
atatttttta 
gtaagttatt 
gtgtttagtg 
tgttgaggtt 
aggttggagg 
gtgtgtagtt 
gaggtattgt 
agtttgtttt 
ttaggttggg 
tattgattgt 
attgaaaagt 
ttttgaaaag 
gtgagaggat 
tgtttagtaa 
ttaaatgtgt 
atttgaggaa 
tattttagga 
aaaaaattaa 
gttttttatt 
ttttgatttt 
agttatatat 
tttgtattga 
atagttaagt 
tgtagttata 
taaataatgt 
ttttatgtag 
ttttttttta 
tattagtata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
* 900 
960 
1020 
1080 
•1140 
1200 
1260 
1320 
1380 
1-4 4 0 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280. 
2340 
2400 
2460 
2501 



<210> 19 
<211> 2501 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA .(Homo sapiens) 



<400> 19 

attgagtatt 
gaagtagttt 
agtgat'ggta 
gagttgttaa 
taaagtgtaa 
atgtttatgt 
ttgaaagtat 



ggtgtaaaat 
ttatttttga 
aaaatgtttt 
aaaagattta 
ataaaatgtt 
agagataatt 
tttttgaaag 



gtattgtata 
atgattttta 
agaatgtaat 
aaatattaag 
tttttaaaaa 
agatagttat 
gtagtataag 



ttatgtgtaa 
gatatagttt 
ttttggattt 
tgttgtaaaa 
tatttagaag 
gggtataaaa 
aagttatttt 



gtatgttaat 
ttgaaaagga 
tttgtatgaa 
tatattattt 
gtatgattat 
tatttggtta 
attaatatag 



ggtgttaaaa 

aagttttttt 

aattatttaa 

ataataaaag 

atttaaatat. 

tttattaata 

atatgaaagt 



60 
120 
180 
240' 
300 
360 
420 



-285 




atataggaat 
tatgtattag 
gttaagattt 
attttattgt 
aaa'atggggt 
tttttattgg 
ttatttttat 
ttttgttaag 
-taatggggaa 
attttatgat 
tttgtgtttt 
gattaggtaa 
aagggttagg 
ttaggtagga. 
tgagagatta 
tatttttttt 
tttgaaaaga 
tatttgtttt. 
tgaataaggt 
gataggagga 
gtaggaagat 
tgattttatt 
gagttgttag 
ttgttgtttt 
• tagtatgtgg 
ttttagggat 
aggggattgt 
taaggggtaa 
tagttatagt 
tatttagagt 
tgtaagtgag 
aaaatttttt 
ttataggtag 
aagttagtaa 
ttattttaga 



taattgatag 
ggttttttta 
ttgtatttgg 
atagtattta 
tattttatat 
ttggtttatt 
tttttttttt 
attttagtgg 
gggtttttat 
agagggaaat 
agggggtgaa 
agatttaaga 
gagaattttt 
ttttgggaag 
ggtgtttttg 
tttattattt 
gtagttagtt 
tgtgttggtg 
ttgtaatgat 
ggtggggaat 
gagtgtagag 
tgttggaaga 
gatttgtgta 
attatttatt 
tttttatttt 
ggtaaagatt 
taaaatgtag 
aaatattttg 
ttaattaata 
tagatggtaa 
ttaatggtgt 
gaaggaggat 
tataaatatt 
gattatgttt 
gagtggtggg 



aattttagtt 
ggtataaggt 
aggttataga 
tgttggtatg 
aataattttt 
ttggaatttt 
tattatatgt 
attttgagaa 
tagatgtttt 
aaatattgaa 
ttttagtttt 
gagttaat'tg 
atttagtttt 
tttatattgg 
gttttttttt 
tagtggggtg 
gtgttttagt 
attagtatta 
atatatatga 
ggtgggggtg 
tgtgtggttt 
tttgtttttt 
gaagaatagt 
taataaatgt 
"tgaggggttt 
tgttagtgta 
.agttttggat. 
ataagtgttt 
tttattgagt 
atttttaata 
taa'aataaat 
taggatttaa 
agttaggttt 
agatgtttga 
aatatattta 



tgtaatgttt 

attattagtg 

ggttatttat 

aatggtttgt 

tattttgttg 

ttgtttttta 

ttgatttttg 

tagattgtga 

ttttaa'agaa 

tgattgaatg 

tgttttttgt 

taggaagagt 

gtaattagta 

ggtgttgggg 

ttttaagaga 

attttagttg 

aaggggtgtg 

tttgtttgtt 

ataagtagag 

ggatgagggt 

ttttggaatt 

aggggttatt 

taatttgata 

tttttgggtt 

ttagagatag 

ataatttttg 

ttttattgtt 

ttaaatgatt 

attttttgag 

atggtgtagt 

agattttatt 

agtggtgatt 

ggggagaaag 

attttgtgtt 

tattatggaa 



aatatataat 

gagaattaaa 

tggtgagaaa' 

tttaattttt 

tttttgaaat 

gttttttaga 

tgtgtatttg 

tttgttaagt 

ggttttttta 

attttttagg 

ggttttttag 

aataattgat 

gtttagtttg 

agaagtgggt 

tgtggtgttt 

tggttgtgtg 

gtttttaatt 

gaattttagt 

ttgttggatg 

tagtgatttg 

ttgggaaatg 

tttgtggttg 

ttttgtgttt 

tagttgttga 

tttttgggaa 

tatttttata 

tattttttaa 

ttgtgtttgt 

ttttgatagg 

ttagatttgt 

tttgttgggg 

tttgtaaaga 

agggtaaagt 

gaaaatggtt 

a 



ttaaataatt 

ggtgtataat 

ttaatgtaaa 

tgtattttaa 

ataggtaatt 

tgtgtttatt 

agtttataat 

atataaattt 

tgaatttaaa 

agttgatatt 

ttttttaaaa 

gtttatagat 

gttttttttt" 

tgtatttgtt 

taagaataat 

tgtatgttgg 

tttgtatgtt 

ggtgagttta 

tt'gattgtgg 

ttgatttttg 

tagttaagag 

gaagtaatgg 

atttatttat ' 

tttagagaaa 

ggaaagtttt 

tgtaaatatt 

aatagaattt 

atgttgagaa 

atttttaagt 

gattaatgtg 

agaaagagga 

aagaag'ggta 

tataaaagta 

ttaagtagga' 



<210> 20. 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 20 

aaagatgatt 
agggtatttt 
ggttataatt. 
ttttttttta 
gagaaatttt 
ttataatgtg 

• ttatttttgt 
attttttgtt 
aagaggttat 
ttttagtata 
ttgtaagtgt 
atgattaggt 

' gtgatttttg 
attgtagatt 
tggtagagtt 
gtttgtttta 
gtttaagtat 



aaaagtttaa 
tagtagtttt 
tttgtgtttt 
ttttgtttta 
aaggtttata 
ttgggtattt 
ggattttttt 
tattgattag 
gttatatgta 
ttttttttta 
ttatttataa 
ttgaaattaa 
tttgtaggat 
ttagttgttg 
ttttaattgt 
ttttgtgtaa 
tgtttaagtg 



ttgtttattt 
tgtttatttt. 
agtaaatttg 
ttattttgat 
ttatttaaat 
ttagatttgt 
tttatttttg 
aggaggggtt 
tatttttata 
ttaattttaa 
attgtgaaat 
ta1:ataagat 
tttttttttt 
* tagaattaat 
tgaattttaa 
tggattttat 
taaattattt 



gaagagttga 
gtttatttgg 
tgtatgttta 
gtaaaattat 
tttgttagat" 
ttattaaaaa 
agatgttttt 
ttaattatgg 
atataattta 
taatattatt 
gaattaaaat 
ttaatttttt 
ttttagatgt 
tagatttaag 
tattgttgtg 
attatagagg 
aatatttttt 



tttttttatt 
ttttttttgt 
tttttttttt 
ttgttaattt 
ttttattttt 
gatgtaaaat 
tttggttgta 
gtgaatttta 
tatttatata 
gtaagttatg 
gaaagggtaa 
ttaattaaag 
tattggattg 
atgagttttt 
attagttagt 
tattttttta 
agttattaag 



tttgtaataa 

ggttgtgtaa 

tgttattatt 

tatttgaaat 

gttatatggt 

aaaataatga 

ttattttttt 

tattttattg 

gtatttttat 

ttgaagtaga 

agattaaatt 

atttttgtag 

tattaggttt 

tgattttttt 

gt'tataattt 

atgttaagat ■ 

taattaagat 



480 
540 
600 • 
660 
720 
780 
840 
900 
960 
1020 
1080-. 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
'2340 
2400 
2460 
2501 



60 
120 
180 
240* 
300 
360 
420 
480 
540 
600 * 
660 • 
720 
.780' . 
840 
900 
960 
1020 : 



aggtaggatt 
tttgaatttt 
tttttttatt 
taattgttag 
tttttaggta 
agtataggtt 
tttatttatt 
tttattttgt 
tg'gaggatgt 
ttattaattt 
agtattatta 
aatgggtaaa 
tttgtgagaa 
tttattttga 
tgttttgaga 
gtttttttta 
aaaatttttt 
ttagttgtta 
agaagataat 
aagattgtga 
tgaggtaaga 
ttttgtaaaa 
atataaagaa 
taaagagaag 
tagaaattag 



ttatttgttt 
atttggttaa ' 
tggttatttt 
attttgtttt 
ttttttggtt 
attatagtta 
atttttttat 
taaatgattg 
gtaagattag 
attttggttt 
atataaataa 
atatttttaa 
ggtaaatgta 
attaagattt 
ttaaaaatta 
tttttttttt 
ttaattttat 
aagtaagttg 
tttaattatt 
tttttaattg 
■tattttattt 
gttagtattt 
aataagtgta 
agattaaata 



taaaatgatt 

ttttaatata 

ttgtattttt 

attttttttt 

ttagtttgag 

attttttgaa 

taaaaaggtt 

ttttttaatg 

ttttttatta 

taaaattttg* 

atttttatta 

aattttgaag 

tttagatata 

tattgtttta 

gtatattaat 

taaaaataag 

aggtttaagg 

aggtattgaa 

ttataaaatt 

tagtttattt 

aaaagtgtat 

atatttttaa 

aaattttaat 

tagtattatt 



tgatttaaat 

atttttagta 

tttgtttttt 

gatagttttt 

tatagtagat 

taaatgtgat 

taaagttttt 

atattttaga 

aataaaaagt 

tgattattag 

taattatatt 

aaattattat 

attaaagttt 

tttttttaga 

a'taatttttt 

tagttattaa 

gtagttaagg 

gatggagaat 

gga'aattgag 

atttttattt 

tttaaattaa 

ataagatata 

ggttagttag 

gtattgaggt 



taaaaagaga 

ttttataatt 

tttttttttt 

atttttaagg 

tttaagatta 

tgaattttat 

atttaatgtt 

attttagaat 

gtgaaatgga 

ataaaattta 

ttttaagttt 

ttgatagaaa 

ttttttttat 

tgttgttatt 

ttaaatgttt 

attttttagt 

atggttgtag 

ttaaattttt 

gtatttaata 

agtatttttt 

gtaataatat 

ttgaatttat 

tttttagttt 

aaggtttttt 



atgtggattt 

ttttttaaag 

ttagttataa ' 

ttatttattt 

tatatgttat 

gttagtaatt 

tttttttatg 

tattttatta 

atatgtaatt 

gaaataaaat 

tgtttgtaag 

gtttaattta 

tttaatttta 

tgaataatat 

taagagtttt 

agtgaatttt 

ttttatatga 

gataagagtt 

tgaaggtatt 

tttgtaaatt' 

gtaaattttt 

ttagtgaatt 

tttttaagat 

gtgtagttta 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 " 
1800 
1860 
1920 - 
1980 
2040 • 
2100 
2160 
2220 
* 2280 • 
2340 
2400 , 
•2460 
2470 



<210> 21 
<211> 2470 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> chemically treated genomic DNA (Homo' sapiens) 



<400> 21 



ttagttttta 
ttttttttta 
ttttttatat 
ttttgtaaga 
ttttatttta 
ttatagtttt 
attatttttt. 
taataattga 
aaaggatttt 
taaaggaaat 
ttttaaaata 
ttaagataaa 
tttttataga 
tttatttatt 
tagtaatatt 
ggattaatga 
atatttttta 
ataaaatgga 
ggtaaataag 
agtttatgtt 
tatttaaaga 
ttggtagtta 
ggtaagagga 
aagatttagg 
aattttgttt 
atatttaaat 
taaggtagat 



tgaattatat 
attttaaaaa 
gatttattga 
aagagtttat 
agtttataag 
gatattttta 
agtttttgtt 
ttatatgaag 
gaaatttatt 
aaaattttta 
atattgttta 
tgaaattaaa- 
tggattaaat 
tttataggta 
attttatttt 
gattatatat 
tggtgaaata 
tatgaaggag* 
aattattaat 
atggtatatg 
gaatggatga 
ttatgattgg 
ttttaggaag 
agatttatat 
attttaatta 
attttgatat 
aggttgtaat 



agaaaatttt 
gaattaagaa 
atgaatttaa 
atattattgt 
agggaatatt 
tattagatgt 
gaagatttga 
ttgtagttat 
attaaaaaat 
agatgtttaa 
agtagtagta 
atagaagaga 
tttttattaa 
aaatttagga 
tgaattttat 
tttattttat 
gttttgaagt 
♦ agtattaaat 
ataaagttta 
fcggttttggg 
ttttaggagt 
gaaagaagag 
aattatagaa 
ttttttttta 
tttaatagtt 
taaaaaaatg 
-attggttagt 



gttttagtat 
ttaattagtt 
tatattttat 
ttaatttaaa 
gaataaaaat 
tttagttttt 
gttttttatt 
ttttggttat 
ttagtggttg 
gaagaattgt 
tttaagagga 
gaattttagt 
gtaataatt't 
aatataatta 
ttgatagtta 
attttttatt 
tttaagatgt 
gaagatttta 
attatattta 
atttgttgtg 
agagattgtt 
gagagataaa 
tgttaggagt 
gtttaggtta 
aaaaagtatt 
tttttgtaat 
tat'gataata 



agtgatgtta 
attggagttt 
ttaaaaatat 
atatattttt 
ggataaatta 
agttttgtaa 
"tttagtgttt 
tt'ttggattt 
tttgttttta 
gttaatatgt 
tgaaatagtg 
t'gtgtttgaa 
ttttaaggtt 
tgataaaaat 
tagaatttta 
tgataaaagg 
tattaaaagg 
agtttttttg 
tttaaaaggt 
tttaaattga 
aaagagaaat 
gaagatataa 
tatattaaga 
aattattttg 
aagtagtttg 
atgaaattta 
ttggagttta 



tatttggttt 
tatatttatf 
aaatgttaat 
aagtaaagtg 
taattaaaag 
gatgattgga 
taatttgttt. 
ataaggttaa 
aagaaagggg 
tagtttttgg 
gagttttagt 
tatatttgtt 
ttaaagatat 
ttatttatat . 
gagttagaat . 
ttaattttat 
taattattta 
ataggaagat 
tgattataat 
ggttaaaaga 
gaagtagagt 
aagatagtta 
ttaattaagt" 
gaataaataa 
tatttaagta 
ttatataaaa 
gtaattggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 ' 
840. 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



• ••• • 



gattttatta 
aatttatagt 
tagggattat 
gtttggttat 
atatttataa 
tatgttaaaa 
ataatttttt 
ggtaaggggt 
ataggagtga 
tatattataa 
aaaatttttt 
taagagaaaa 
agttataatt 
aaagtatttt 
aattattttt 



aaggaaatta 
aaatttggta 
ttataaaagt 
gatttaattt 
tttgttttaa 
ataaaagtat 
tagtaagata 
gaagaagtaa 
ttattatttt 
attatatgat 
attttaaata 
ggtagtaata 
ttatataatt 
ttattatagg 



ggggatttat 

taatttaatg 

ttttggttga 

ttgttttttt 

tataatttat 

tatgtaaatg 

tagggtttat 

tgtagttaaa 

gttttatatt 

agaggtaggg 

aaattaatag 

gagaaaaaaa 

ataaaaagag 

aataaaaaaa 



tttagattta 
atatttgagg 
aaaaaattga 
attttaattt 
agtgatatta 
taagttatat 
ttatagttaa 
ggagatattt 
tttttaataa 
atttgataga 
gtaattttat 
tgggtatgta 
.ttggatgggt 
ttaatttttt 



gttagtttta 
aggaagggga 
gttttgtgtg 
attttataat 
ttaggattaa 
tatgaaaata 
gatttttttt 
taaaaataaa 
gtaggtttga 
atttgaataa 
attaaaataa 
taagtttatt 
aagatgagta 
agatgaataa 



taatggttag 
gttttgtagg 
ttaattttag 
ttgtaaatga 
taaaaaaagg 
tatatgtaat 
ttgattaatg 
ggaaaaattt 
aaatatttag 
tgtgaatttt 
taaaataaaa 
gagatataga 
gagattgtta 
ttaaattttt 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2470 




<210> 22 . 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 22. 




aatgtaatgg 
gtgtggaagg 
ttggagagat 
gaatggtgat 
ttggagatat 
tagtaagtta 
gagttgggag 
gtatattttt 
aagagaatgt 
ttttgattgt 
tttttatgtt 
tttttttttt 
tggttattgt 
tttaatattt. 
gtgtttaaat 
ggtattatta 
tataaatttt 
gtgattagag 
gggtgttatg 
gttagtatta 
gttattggga 
attagtttaa 
tatgtatttt 
gggtttggtt' 
ttaaaatatt 
gtttagagat 
tttattgtta 
atttatatat 
tgtttttttt 
gta'ggggtaa 
ttgttttttg. 
tgtgtagtag 
' tgttttttga 
ttatatattg 
tttt'tgattt 
• aggtttgttt 
■ tggttagtgt 



aaaaagagag 
taagggaaaa 
agtgatgttg 
ttatatttga 
atttaatgga 
tggttggaat 
tttgagtgaa 
ttatttatgt 
tggagagaaa 
aagtttttga 
aatagtgaat 
tttttttttg 
tgatttatag 
atttatattt 
attgtaatat 
atatttttat 
ttagtaattg 
ttatttaata 
tgttttggga 
tggttgagag 
aatgagagat 
aataaaggta 
ttatafegttt 
aatgaagtgg 
tattttaaag 
tgggggttag 
aattatattt 
ttttgattgg 
tttgtttggg 
aggggttggg 
tgagtagata 
tttagttgtg 
atttttgtta 
agttatttgt 
tta'taagttt 
tgattttttt 
ttgtatttta 



attgtaaagt 
ttagtttttg 
ggtatagata 
ataagtttgt 
ggagttagat 
gtgtgtgttt 
gtttttgtta 
ttattaagtt 
gtggttaaga 
ataattaaag 
aattgtagat 
aatatttttg 
gaggtattat 
agtatttatt 
tgggggtttt 
ggttttttat 
tatagtgttt 
tttaaggtgg 
ttttgagtgt 
aatagttgag 
tttgttttta 
tttattttta 
atagaaagta 
aggagtatta 
tttggttttt 
ggttagtaag 
ttttggaaaa 
tattttattt 
gtggtagtag 
gttttttttt 
gtaagttttt 
tgtttgttgg 
ttttatgtat 
atatgtgagt 
atggaatatt 
ttttgtagtt 
gtagggagat 



tagaaggttt 
aagaagatag 
tggggaagtt 
aatgtttagt 
taatttttat 
agtaggagag 
aaggtagaag 
tagggataag 
aaattgtttt 
tttgtgagga 
tttttttttt 
tttttttttg 
tgttattaat 
tttaatattg 
ttaaagtata 
tttttttttg 
tagggttaga 
tgattaatgt 
gggtatttta 
aaagtggtta 
attatggtt^ 
ttttatgttt 
gttaattgag 
tattttagtt 
tggtgtaatt 
gtatttgatt 
attatttatg 
tagttgtaag 
aaagtatagg 
tttagtatag 
ttttattttt 
gaggggttgt 
aaatatattt 
atattttttt 
tttggaaaga 
ttagtatttt 
gaggattgtt 



aggaattgtt 
tgagatttta 
gagaggaata 
agattgttgg 
ttttttttat 
ggtagggagg 
aggaaagttg 
gtttattaag 
tattgaattt 
gatagttaat • 
tttttttttt 
ggattggttt 
aaagggtaat 
attatggaga 
aaaatatata 
tattggtttt 
gattggttat 
ttggtaataa 
ggagtatttt 
agaggtggat 
gtgtaatttg 
atattttagg 
ttatttatgg 
ggaaatatat 
ggaggtatgg 
tatatgtatt 
ttttattttg 
attatgattt 
gtatttttta 
.ttttttttgg 
tattgttatt 
taagtgtttt 
atatattttt 
tttttttttt 
tgtttttgat 
gagaaagtaa 
gttttttatg 



ttttgattag 
atttgggtgg 
ttatgtttga 
aaaagtggg'g 
ttgagagagt 
gaagttaaga' 
gtgtagtata . 
atgagttt'gg 
tttgggttaa 
ttttttattt 
ttttttttgt 
agagtatggg 
agtttttttt 
gagttttttt 
±ttgtatgat 
aagagttatt • 
ttttggtatt 
agtttttatt 
agtattgtgt 
ttatgtgaat 
aaagtttaaa 
tttttaataa 
aaaggtttgt 
ttttagaatg 
taatgttttt 
ttagaaggtt 
taaatttgat 
atagtaagtt 
gtttttaagg 
ttgtg'ttata 
tatttagtgt 
gtttattggt 
tttgtttagt- 
' a'tttttttgg 
ttagtagggt ■ 
tttatttttt 
ggggtatgtg 



60 
120 
180 
240 

^300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 

. 1200 
1260 
1320 
1380 

" 1440 
1500 
1560 
1620 
1680 
1740 
1800. 
1860 
1920 
1980 
2040 
210Q 

• 2160 
2220 



tgtgtttttt 
taggtttata 
ttgaaaattg 
gtaatgattg 
ttatttgttg 
gtatattgtt 
aaatagttaa 
attttgttag 
gtattagagt 
attatag.atg 
tatatatata 
aatttagtta 
tagtttaagt 
attttaaaaa 
tttgtgattt 
. gttttggttg 
gtttttttga 
ttaatataaa 
attttgggag 
ggtaaaatgt 
ttttttgtta 
tgtatatttt 
tatgtatata 
aatagagata 
tttttggatt 
gatttttgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgta 
gttggtggag 
tggatttgta 
gttgtgttgt 
ttttttgttt 
ttttttatat 
agtttttgaa 
tggatagtag 
ttgtggttgt 
ttgaggttgt 
• ttttgagttt 
atggttagta 
ttggtttgtt 
gtttggtagg* 
tgtgggatgt 
ggttttgtgt 
aaatgtattt 
ttaataaaat 
tttaggtaaa 
aaagttgggg 
ttttgtttaa 
agttatgatg 
tttatgtttt 
ttggttgagt 
taggtgtttt 
ttgttagttt 
ttagttaggg 
gagattattt 
ataagaattg 
tttgtttttt 
gtgaatattt 
agggtgttat 
ttggtgtttt 
tttttaggga 
attgaagggt 



ttttttttta 
ttttttaaga 
aaagtgtgat 
ttggttataa 
ataatttata 
taataatggt 
aaatgtaatt 
atattttgtt. 
ttattaggtt 
tatgatatat 
tatatatata 
aaaatttttt 
gaattgagaa 
tttgttagtt 
gaggttatgt 
tataattaat 
attatttttt 
tatataagtt 
tattttgtag 
agtttttttt 
gtttttttag 
attttatttg 
ttatgtattt 
tattggagtt 
tgttttttgt 
tgtttttgtt 
aagtttaggg 
taagttaatg 
tatgagtttg 
ggtgtttgtt 
ggtttttggg 
ttttaatttt 
tgtggggata 
taaagtattt 
ttgtttgtag 
taagtttgtt 
tgttaatgtg 
ggtgtttggt 
gttgatgtta 
ggtgttttat 
ggtattttat 
agggagggag 
gtgatttgag 
gtagtttgtg 
tttatttatt 
atatgttgga 
taaatatggg 
gtgtttggag 
tgtgtttttt 
atgatttttt 
ggggtttta ; g 
ttgtgttgga 
gtgagtttaa 
agtgataagt 
tttgtgtttt 
tgtgatttta 
tttttttttt 
gttttttgtt 
ttttaggtgg 
atggttgggt 
tatatttttt- 
tatttatagt 
agattttggg 



ggatttgtag 
gttttatgaa 
ttatggaaat 
agggaaatat 
gttattaatg 
ttatttttgt 
tttagtaata 
gggtgtttgg 
gtagtaaagg 
tttatttatt 
tatatgtgtg 
tttatttgtt 
gattgagttg 
ggattagatt 
ttggtatgaa 
ttgtggaagg 
atatgagaat 
aggttgagag 
aggaagaaat . 
atatgtatat 
ttgttaaata 
ttttattttg 
ttagttaatg 
tggtatgggg 
gtttatttta 
ttttggttgt 
aagttgtttt 
ttagggtaag 
ggagattagt 
ttgggattgt 
gtagggttgg 
gggttgtgtt 
tggtttgtat 
gggatggttt 
tttaagattt 
gtgtataatt 
taggtttatg 
tttaatggtt 
ttgtatttgt 
tatttggaga 
aggtatttgt 
ggagggaggg 
ggtgtgtgta 
ttgtgtttag 
tattgtttgt 
aaattagata 
gtgttttgag 
ttagtagagt 
aaattttttt 
gggaagtgat 
agagtgattg 
gtggattgtt 
aataagttat 
ttgttttatt 
gtagttattg 
atggtgaagg 
tttttttttt 
tttttgatgg 
ttataggata 
tttgaattag 
tgttttgtaa 
agaaggaagg 
ggtatttagt 



gattttttgt 
gtgttttttg 
taaattattt 
ttgtgatgta 
ttaaatttga 
aagaattaga 
gtaagtttgg 
gatagtaatt 
aagtttattt 
ttttgttttt 
tgtgtatgtg 
ttttatttgg 
taggattaaa 
gataatgtaa 
aaggttatat 
tatgaatatt 
ttttaatggg 
aattttagaa 
tgaggttttg 
tttgaatttt 
tagttgtttg 
gttatagtgt 
aggaggggga 
tatataaggt 
agtttagttt 
"gaaatttagt 
ttgggattgt 
gtaatagttt 
atttaaagtt 
-atttgttttt 
ggttagagtt 
ttttttttag 
tttgtttgtg 
tattgtatta 
ttttggagtg 
attttgaggg 
gttagattgg 
tggggggttt 
tgttgtagtt 
atgagtttag 
gtttgtgttg 
agaagggaga 
gggagtttgg 
agttaagttt 
gtgagaggta 
aagtagtagt 
ttatttggga 
ttagtagagt 
ttatttaga't 
ttttgttatt 
ggagttgaat 
ggtatgtgat 
atggaagtat 
tggggagaat 
tggaaggagt 
ttgtgtgttt 
ttttttattt 
gtttgtagag 
ggtgtatttt 
gtatttttta 
gttgtggatt 
ggattaaagt 
gtgtattttt 



gttatttgta 
tatgtgtttt 
gtaaaaaatt 
tttaatgtgt 
ttttggtttt 
taaaatgtat 
tatttagata 
aaaataaagt 
aaaagtataa 
ttaatatgta 
tgtgtgtatg 
atatttgatt 
ggatagatat 
tataattgtt 
tttatattta 
tatgtgtgtt 
attaagtata 
ggttgtggaa 
gtaggttgta 
tgtttttttt 
tggttggttg 
agtttttttt 
attaaataga 
agtatattag 
tttttgggtt 
ttttatttag 
tttaaattga 
ttggttgttt 
ggaggtttgg 
gttgggttgt 
tgtgtgttgg 
gtggtttgtt 
gttatggatt 
gatttaaggg 
gtttttgggt 
tgttgtttat 
ttttttttat 
ttttttattt 
gttgtttttt 
tggttatatg 
tttgttgggg 
gtttagggag 
ggtgtgtggt 
ttttgttggg 
gatttgaaag 
tatttgtggg 
aggttttgtt 
tttatttatt 
tatttgattg 
tgtttttttt 
gggtttgatt 
ttttgatagt 
aagtgtttaa 
gttttggagt 
gtggttggtt 
ttattttaat 
ttttttgttt 
ggattaggtg 
ggattgggtt 
attgtatatt 
agtttttgtt 
gtag'tttggt 
agttgtt'ttg 



tataatttgg 
aaaaaggtat 
gttttggaaa 
ttttaatttt 
agttatattt 
atttgatata 
gattatgaat 
attgatagtt 
attatttaag 
tatatatata 
tttaattttt 
ttgtatattt 
gtagaaatgt 
aaagttttgg 
gttttttgaa 
ttaattaaag 
gtattgtggt 
gggtttattt 
tttttttgat 
tttttagatg 
tgtatgtaat 
agggttattt . 
aagagagata 
agaaagttgg 
atttttagta 
tagtgatgat 
gttgtgtttg 
tttagtattt 
gagtttagga 
ttggttttat 
tgggatatgt 
ggtttttgag 
atgattatga 
aatgagttgg 
gaggtgtatt 
gagtttaatg 
ggttttgggt 
aatagtgtgt 
ttgtagtttt 
gtgtgtgagg 
tggttgttgt 
ttgtgggagt 
ttagtttggg 
tagttgaaaa 
tttgggtttt 
ggaaaatatt 
tttggtattt 
tttttaatgt 
gaaatatgtt 
ttttttttat 
ttggagttag 
tggaaatttg 
aaataatttt 
ggtgtgtggg* 
taggatatag 
tttttttttt 
agtttttttt 
ggtgtttttg 
tggaagtttt 
ggtatttgga 
tagtattttg 
tttagaggat 
gagaaattta 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 . . 
2940 ' 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 • 
3660 
3720 
3780 • 
3840 
3900 
3960 
4020 . . 
4080 
4140 
4200 
4260 
4320 
4380 
. 4440 
4500 
4560 . 
4620 
4680 • 
4740 
4800. 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400. 
5460 
5520 
5580 
5640 
5700 
5760 
5820" 
5880 
.5940 
6000 



gagtatttta 

aatttttaaa 

agagaaaaag 

gattaagttt 

gatgtgaatt 

ttagtttttt 

tttggtagat 

gggattgtat 

tgttttattt 

agagataagg 

tttagtttta 

agaattttgt 

gatagatatt 

aaaggaaagg 

agttttgggg 

ttagtttagt 

aaatattaga 



tagttatgta 

ggttggtata 

gatttttttt 

aattttatat 

ttataatttt 

tattgtaaat 

tatttattgt 

tttatttaaa 

tatatttttt 

aatttaaaaa 

gaatgatttt 

ttgttattat 

atttagtatt 

attgattggg 

ttgaaagtag 

gagtttagaa 

gtgtatggta 



gatttaagtt 

ttttaaatta 

gtttattttt 

aatatttaaa 

atgatggatt 

taatgttgtt 

tttgttaata 

attgtttatt 

gtattatatg 

ttaaaaaaaa 

ttttgtgttt 

ttagtaagga 

tttttttttt 

tttttttagg 

tgtttttaaa 

tattttttaa 

ttttgtaagg 



gtttttattt 

attttatttg 

ttttttttaa 

attgtttgat 

ttagaaatta 

gaattttaat 

attataaagt 

attttagtga 

atatttaaat 

aatttgtata 

tattttttgg 

aaagaagtat 

tagttttttt 

aaattttaag 

ttgtttgtta 

aaggattaaa 

ataagttttg 



aaaagataaa 

ttttaattta 

atggaagaat 

gtgaaggaag 

tttttttttt 

gggtattaat 

gaatttggtt 

taagtggtat 

atttttagaa 

aatgggattt 

attatttttt 

ttatgtaagt 

gtttaaatga 

ttttttaagt 

tgatttagag 

atggaaagga 

t 



taatgaataa 

gggttaaaat 

aaagtatagt 

gtattggtat 

tatttaattt 

gagattgttt 

aaatatagag 

tagtgtaata 

taataaaaaa 

tgtgtggaaa 

tttttttgtt 

tttttatatg 

ttttgggtat 

agtttttaaa 

ggttatgaat 

atataataga 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600' 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



<210> 23 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 23 



ataaaattta 
ttttagtttt 
atgataagta 
atttaaagtt 
tagaaaaatt 
gatgtttttt 
tttgggaaat 
ttgtgtaagt 
tgtttgaata 
attattggga 
tattttgtaa 
tattgaaatt 
atggtttttg 
tattaagtag 
tttaaaaaaa 
ataagtaaaa 
tgagtaaaga 
tgaagatgta 
tattttagtt 
tggtttatgg 
ttgggaaatg 
tggggtatat 
tttttgtagg 
aaaatgggag 
aggatatata 
♦tgtttttttt 
tatttttttt 
tgtgttttta 
agttatatgt 
tatttaattt 
gggtgatagg 
agtttagatg 
. aattttgttg 
tttttaagtg 
"aattgttgtt 



tttttatgaa 
tttaaaaaat 
gtttgaaaat 
ttttaaaaga 
gagaggggaa 
ttttttgttg 
aaaatatagg 
ttttttttta 
ttatgtaatg 
tgatgaataa 
ttattagtaa 
tagtagtatt 
gagtttatta 
ttttaaatgt 
aaaaggtggg 
ttaatttgaa 
tagtttggat 
tattgaatat 
tttttttttt 
tttataaggt 
tttaatttag 
ttgttttgtg 
tttattagga 
ggagaggaga 
atttttgtta 
atagtggttg 
gggtgggata 
tatggtttgt 
tagtaatttg 
ttaattgttt 
aattgttttt 
aaaggaaatt 
agttttgtta 
atttaaagtg 
ttgtttggtt 



atattatgta 
attttagatt 
attgttttta 
gtttaattaa 
aaatattaaa 
gataataata 
aaaaattatt 
atttttaagt 
tagaaagtgt 
ttttgaataa 
ggtagtaaat 
aatttgtaat 
taaggttatg 
tgtgtggaat 
tagaagaaat 
atatgttaat 
ttgtgtggtt 
ttttagaatt 
tgttataaat 
agaaagatat 
gatttagtta 
gttatttagg 
aaataaaaaa 
gggagaaaga 
ttgaagttat 
tgaggtgtag 
ggtttgttat 
tttaaatttg 
ttttagtgtg 
tttaaagttt 
tagaaagtta 
tggggagtat 
attttaaata 
ttttgtgttt 
ttttaatgtg 



ttttgatatt 
tattaaattg 
gttttaaaat 
tttttttttt 
taatatttgt 
ggtagaattt 
ttaaaattga 
tttttgtttt 
aagatagggt 
gatgtgattt 
aatttattaa 
aaaagaattg 
gaatttatat 
tagatttaat 
tttttttttt 
ttttaaaagt 
gtgggatgtt 
tgttttttag 
atttttggaa 
agggatatta 
tgtggtgttt 
aaggtgttta 
taaaaaataa 
ataatttttg 
aaagtggttt 
attttggttg 
tgggttggta 
tgggatattt 
gatttagtta 
taggatgtgg 
ttattatagt 
attaaataaa 
tttttaattt 
atttgtttgg 
tgttttgtta 



ttttattata 
agtttatgat 
ttttgagaat 
tatatttaga 
ttatatgaag 
taataaaaga 
atttttatat 
tttttttatt 
atattatatt 
tttttgtatt 
ggagtagttt 
aaaattaaat 
tatattagtg 
tgttgtattt 
tgttttaatt 
tttgtttatt 
ttaaattttt 
tattttttgg 
atagaatatt 
gtttggatat 
tgaagttttt 
ttagaagtgt 
aaggagaaat 
tagggaaaaa 
tttgtgtttt 
atttgtatgt 
ggagattatt 
ataaattttt 
gttaattttg 
ggtggggagg 
tgatatattt 
aatattaaaa 
taaatgttaa 
aggtgttttt 
ggaagtttgg 



ttttttttta 
tttttgggtt 
tatttaagaa 
gttatttaag 
aatttgtata 
ggaaagataa 
agagttttat 
attttaagag . 
gatattattt 
tgattaggtt 
tattagtgtt 
agggaagaaa 
ttttttttta . 
tgttttttta 
ttaaattaaa 
gtttgttttt 
ttaaggtggt 
ggttaaattg 
gaataaaaat. 
tagtgtatag 
aaatttagtt 
ttatttaatt 
tgggtaagag 
aattaaaatg 
ggattggttg 
tattttgggg 
tttaagtatt 
ggttgttaga 
aaattagatt 
aggggaaagt 
ttaattaaat 
ggataaataa 
gagtgagatt 
ttttataaat 
gtttttgggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600' 
660 
720 
780 
840 
900 . 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500' 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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ttgttttttg 
agaatttgat 
tttgggtttt 
tttttttttt 
gtggtgtggg 
gttgggtttg 
tagttgtggt 
gaaatttttt 
gttggtttga 
gtttttgggg 
ttgttttagg 
ttgatgtagt 
ttgtgggtag 
tttggaaaaa 
gagttttggt 
tgggagtaag 
taattttaag 
gggattgttg 
agtgatttta 
ggttgagttt 
aagattgggt 
tgttttatgt 
tttttttttt 
tatattgtaa 
atagatagtt 
ggagatttag 
ttaggatttt 
tttttgagat 
ttttattagg 
gggtgtttat 
aatgtgattt 
gttatattgt 
ttttagtttt 
tttagatgaa 
gtatatatat 
aggagataaa 
tgaataaatt 
taattgttat 
gttaagttta 
' atttaatttt 




attgagttta 
gtgtattgta 
tagatgattt 
gtaaaaagta 
tataaagaat 
tagtaatttt 
taaaatgtta 
gtttttttag 
ggaaggaatg 
tggatgtgtt 
ggtagttttt 
ggggagtgag 
gttgtattgg 
ttttatgttt 
ttttatggtt 
atatgggtgg 
ggattaaatg 
tgattgtatt 
tagttgaaat 
gtttagttga 
tgaatataag 
ttgattgtga 
ttgattattt 



tatggatggt 
tttgaatgta 
ttgtgtgtat 
tttttttttt 
tgtgggtatt 
ttttttaggt 
ggtgggtgta 
aggttgttgg 
ttgtagattt 
tagttgtata 
gggattttga 
agggttattt 
ggtgtagatt 
gagtatagtt 
tttggttttg 
tgtagtttta 
tattggtttt 
ttttgttttg 
aagagtagtt 
tatggttagg 
ttaaaataaa 
gttttgtgtt 
ttgttggtta 
ttgagatagg 
atatttagta 
ggtatatata 
agtttttttt 
ttttttagtt 
aatttttatg 
gttttttata 
ttttatatta 
tggtttggtt 
ataatttgat 
aaataaatag 
atatatatat' 
aaataggtga 
ttttttgttg 
tttaaatatt 
ttgttattaa 
tataaaaata 
gtattaatga 
gatatttttt 
aatttttata 
ttttataggg 
tttataagtt 
tatttttttg 
aagttataga 
aaatgtttta 
tgtttgtatg 
tgtgtgtgga 
tttggtagat 
gggagatttg 
ggaggagaaa 
tttgttgtta 
aaaatagaat 
tttttttgaa 
ttttattggt 
aggaaattag 
gtaatatttt 
ttgttttttg 
ataaaggtag 
ttgggaatga 
ttttaattat 



aagtgggtgg 
gtgtgggttg 
ttttgggttg 
tttttttttt 
tgtagaatgt 
agtagggtat 
gtagtattag 
agttgaatgt 
gtgtgttggt 
tggtgggttt 
gttgtggatg 
tagatgtttt 
gtgtttttgt 
tgaggttaga 
ttttgggagt 
ggatgaatgt 
ttgagtttat 
atattggttt 
tttttgaatt 
gggtgggggt 
tgtgaaagat 
agattttgat 
gaaatatgta 
gtagatagaa 
gttgggggaa 
taggaagagt 
tttgtaaaat 
tgatttgtgt 
tgaaggatga 
ggttggttat 
aatataattt 
tagttaatag 
ttttttggtt 
aaaaaagttt 
atgtgtatat 
agtatattat 
tagtttaata 
taatagagta 
gagfctatatt 
gattattgtt 
ttatagattg. 
tttatggtta 
aattatattt 
tttttaaaaa 
ttgaaagaaa 
ttaggatata 
gagagaaatt 
tgggtttgta 
tgtgagtggt 
atggtaggga 
atgtagttgg 
ttgtttgttt 
ttttagtttt 
ttttaagtaa 
gttagttaaa 
agaatataat 
tttagttttt 
attttggaat 
tttattttgt 
taaatatgtg 
atatttttgt 
ggttttttat 
ttttttggtt 



agagtatgtt 
tatgtagaat 
tgtgttttgt 
ttttgttggg 
tggtgggttg 
ttgttggttg 
tgggtttgga 
tgtagtttta 
ggtggttgtg 
gttgttgttt 
gtttaggggt 
ggtgtggagg 
agggtagaag 
ggtgatgtag 
ttgtgggttt 
tttttgttag 
atgtattata 
aaatattatt 
ttattttatt 
gtatgaggat 
ggatttaggg 
atatttttgt 
gtgtgtatat 
tggggtgtgt 
ttgatagggg 
tgtattttgt 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttatga 
atttttaaaa 
tatttgggtt 
ttaattgaat 
atatatatat 
gtgtttataa 
gattttgata 
tttgatgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg- 
ttagtgtgt'g 
tttgggaagt 
gttattgtat 
atagggagta 
tttgttttta 
gaggaaaaat 
agtgtatgga 
ttaataataa 
agtttttgaa 
aaatgttttg 
taattaaatt 
aaaaatatgt 
tttaaattga 
tttttagtgg 
atagtattaa 



ttttttagtt 
ttttgggttg 
ggtttttgta 
tgtggtggtt 
gttttgtgta 
tggggttgta 
gatatgttgt 
gatttggggt 
gtgttgaatt 
aggtatattt 
tttagtttgt 
gttatggtta 
gtttagaaat 
tgtatgtttt 
ggtgaagttg 
tttttgggtt 
aaggtgttgg 
ttaggtataa 
tgttgttgtt 
ttgttaaagg 
gttggttttt 
ttgttttttt 
aggatga'ttt 
ggttgtatat 
gtatttgagg 
tattaggaga 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
tgtatttttg- 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 
agaaaggtaa 
tattttgttg 
gttgagagag 
aattaggtag 
agttagtagg 
agtgttggat 
gtgtggtata 
tttttggagg 
aggtttgttg 
tattaagttt 
aagttttttg 
tagaggtatt 
gtattttagg 
tataaatttt 
attattaaaa 
ttttaggttt 
tgtt'tatatg 
tatgtaatat 



gtttggtgag 
ggttgtgtgt 
gttttttagg 
attttgatgg 
ttgtgtagtt 
ggaaaggtga 
tgagtggggg 
tgtaggggag ■ 
tgtaggtggt 
tgtttagggg 
ttttttggat 
tggtttgtgg 
tggtgggtta 
gttgatatgt 
ggtgatttga 
tttgggtttt 
aggatggtta 
tttgatttgg 
gttggataga 
tggtttaggg 
tttaatgtgt 
ttttgtttga 
tggggaggat 
gtagttagtt 
ggaagggggt 
atgtaatttg 
tttttataat 
ttgtatttgg 
gggtgtatat 
ttgaatataa 
tggtaattat 
tatgtttatt 
gaattaaata 
atatgtatat 
atatatatta 
tttatatttt 
tattttgttt 
ttatttaaat 
tatatatttt 
tgaagttaaa 
aatatattag 
taatttttta 
taaaatatat 



atgtaaatgg 

ttatggagaa 

gttgtttttt 

gattaagaat 

tgagaaggaa 

agagagtgtg 

tagggtattt 

gaatggtagt 

gttagagaaa 

ttggaaagta 

tgaattatag 

ataaaatagg 

ggatatatgt 

gttatgtttt 

gatgtgtttt 

tttatgaata 

gtttaggata 

ttgagttgta 

gatttatttt 

tgaggtgttt 



2160 

2220 

2280 

2340 

2400 

2460 • 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940' 

3000 

3060 

3120 

3180 

3240 

3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 ' 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
. 5700 
5760 
5820 
5880- 



ttagagtgtt 

gatattagtt 

ttttggtttt 

ataaaaaaga 

ttgtgttttg 

ttagtattga 

tgttggtgat 

ttagaaaaaa 

aaggaaaggg 

ttttttatag 

aaaggtagtt 

ttttgttttt 

ttttttgttt 

ttttttttgt 

ggtaaagatt 

tgttgggtat 

taattttttt 

attgtttttt 

taagtttttt 



tatgtttagg 

attattttgg 

gggatattat 

aatagaaagt 

aaagattttt 

aaataaatat 

agtggtgttt 

gtaagagtat 

gatttgtaat 

gttttgattg 

tttttaatta 

gggtttggtg 

ttggtaaaaa 

tggatatata 

gatttggttt 

tgtaggtttg 

gtgtttgtgt 

ttgagggttg 

agttttataa 



gttttaggat 

atattaaatg 

gtagttattg 

tataaaaatg 

agtattgtag 

tggatataaa 

tttgtaggtt 

ttagggaggg 

tgtttattat 

tttagagatt 

tttttttttt 

agtatgggtg 

ttttatttgg 

tgttttagtt 

ttttgttgaa 

tttaaatatg 

ttagtattat 

attttttttt 

tttttttttt 



atatgatatt 

attttaatta 

agagatttat 

ttaatgatgt 

tgtttgagta 

taaatattga 

aataatggtt 

aggagagagg 

tgatatagga 

tataattaaa 

agtatttttt 

ggaaagtata 

gtttttagtt 

gtgatttatt 

tgtgttttta 

agttattatt 

tgttttttta 

gttttttata 

tttattgtat 



taatggaggt 

taatgttagg 

gagtggtttt 

tattatgtaa 

taggagagtt' 

aaagaaagat 

atttatgttt 

aataggggaa 

agaataagaa 

gttagtttaa 

tttaaattta 

ttgtgttatg" 

tttttggttt 

ggttttttta 

gttttatttt 

tttaaatatg 

gttatttaga 

tttaattaag 

t 



tttgttgtta 

agtggttggt 

tgagattagt 

atatatgttt 

ttttttatag 

tgttattttt 

tagattagtt 

aggagaagga 

ggttagttgt 

gaagtttagt 

ttttggtgag 

ttgatttttt 

ttttttttat 

ggtgaagaag 

tttagtggtt 

gtgttttttt 

ttaaaatttt 

aggtaatttt 



<210> .24 
<211> HOOl 
<212> DNA 
- <Z13> Artificial Sequence 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 24 



ttaagttaga 
attttatagg 
ttttttatgt 
taaagtatta 
gtttggaagt 
ttgtttttta 
ttagtgataa 
tttttaggat 
gttgtttatt 
tagggttagt 
. tagggtgggg 
ttgtgggggt 
tttgtgtgtg 
tggtgtttga 
tttttgttgg 
gtgtaagaat 
ggtgagatgt 
- tttaagataa 
tttttatgtt 
ttgggtattg 
• ttattttata 
tagggttttt 
tttgtttttg 
ttttgtatgt 
gtttagggtt 
ggtatatggt 
taaaattagg 
taggaggatt 
atttttatta 
aattgtttag* 
agttattatt 
ataaaaaata 
ggttgaggtg 



tgttttttaa 
tttaatattt 
tattagatgg 
aggtaatttt 
ttttgtttta 
gtataatttt 
ttaagaaatt 
aaagtttagg 
atttaatagt 
tgtttagttt 
tgggaagttt 
tatttattat 
atagtatatg 
tatttgtgtt 
agggtataga 
tgtttttgtt 
tagtttattt 
ttttagttgt 
ttttttttga 
tattttttga 
ggtttgtttt 
ttgaaaaatg 
atgtatatat 
tttgggtgga 
' ttagtattga 
taggggttag 
ttgggtgtgg 
atttgatttt 
aaaaaagaaa 
gaggttgagg 
gtgttattgt 
aaaatagcftt 
ggtggattat 



ttatttgtgg 
tggagttaga 
atttttgtat 
ttttggtttt 
gttataatgt 
ttttttaagt 
aagaattatg 
gttttgttat 
ttttgtaagt 
ttgtatgttt 
attttggttg 
taagaatttt 
taggtgtttg 
ttgttttggg 
gatagggtgt 
ttaaagaaat 
ggggttaggt 
gattttttga 
gtagttaatt 
ttttttgttt 
gtaagatggg 
gtttttgggg 
atatagttgg 
taagggaaag 
atgtagtttt 
gaatagttga 
tggtttatat 
taggagtttg 
aaaaaaaatt 
tgggaggatt 
attttagttt 
gggtatgttg 
ttgaggttag 



ataggttagg 
aagtttttag 
ttttagaaga 
ttgggataat 
ttatatattt 
tttattgtag 
tatttatgtt 
tgggttttgt 
atattgtgaa 
aggtgtatgt 
tgatgtttag 
tgtagtaggg 
atagtatttt 
ttttgttggg 
attagtttta 
agagaaatta 
tgggtatgta 
gtattaggta 
tattaagttt 
aattttagtt 
ttagtatgga 
tagtttttgg 
tgt.ttatt.tt 
atagaattat 
aaggatatta 
gggattttga 
ttgtaatttt 
agattagttt 
agttaggtgt 
gtttgagttt 
ggggaattga 
gtttatgttt 
gagtttgaga 



tatattt'tga 
gtaaaaagtt 
ttttttatat 
tttaggtttt 
ttggaattgt 
ttatagttta 
tattttttta 
taggagttgt 
ggggaagtaa 
gtatatattt 
tgtgtttaaa 
ttttaatgat 
tgggtaggta 
tattttgtaa 
gttgaatttt 
aattttgata 
taaa'ttattg 
tatagttggg 
atgaagaggt 
tgagaaaggt 
tatagggttt 
aaattgattg 
•gaatttatta 
ttggtttttt 
tagaagtagg 
agagggattt 
agtattttgg 
gggtaatata 
ggtggtgtgt 
gggagattgt 
gtgagatttt 
gtaattttag 
ttagtttgat 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480- 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



gtttaatttt ■ 
tgaagggggt 
taggaaagat 
ggtatgagtg 
tttgtagggt 
ttagttttgt 
gagttatttt. 
agttgtaggg 
tgattagaga 
ttatataggg 
gagtgtaaat 
tttatgtgtt 
aaaggaagtg 
" ggagggtggt 
gatgaagttt, 
ataggtttta 
tttgtgttta 
tatttgtttt 
tgttgttgat 
taggtgtttt 
ttgaggaatt 
tttttggttt 
ttgtttttgg 
tttgttgttt 
ggtaattgaa 
ttatttaaag 
gaggttaagg 
gtaagatttt 
ttgtagtttt 
agttatagta 
gttttaaaat 
tattttggga 
taatatggag 



60 
120 
180 
240 
■300 
360 
420 
480 
540 
600 
. .6.60 . 
720 
780 * 
840 
900 
960 
1020 
1080 
1140 
1200 
1260. 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
" 1800 
1860 
1920 
1980 
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aaattttgtt 
ttagttattt 
tgaattgaga 
aaaaaaaaaa 
tgagttggag 
tggggttatg 
ttattagtgg 
aattattgtg 
ggttttaatt 
ttgtttggag 
tttaggttgt 
agtgattttt 
atttggttaa 
tttgaatttt 
ggtgtgaatt 
tttagggatt 
attgattttt. 
gt-tagatagg 
aatattttgt 
taggagagtt 
tgatggggat 
tttttattta 
gttagggttt 
ttttaagtat 
aattttattg 
tggaaaatga 
tgggaagaag 
agtgttaggg 
gttaatttgt 
tttttttttt 
tttttttttt 
. .tattgtatag 
tgtttgttgg 



tttattaaaa 
aggaggttga 
ttgtgttatt 
aattagtaaa 
tgagggtttg 
atttattgta 
atagagtagt 
atatgatgtt 
tgttagaggt 
atttggttta 
agtgtagtgg 
ttgttttaga 
tttttgtatt 
tgattttaag 
attgtatttg 
ggatttttgt 
tttttttaga 
ttttagttag 
tttagaagaa 
agggtttttt 
aaatggagag 
gaatttgttg 
tataggaaaa 
ttaaaggttt 
gggatagtgg 
gggttggagg 
tatttagatg 
taagaaggaa 
ttttgtgatg 
tggatgattt 
tggtgatgaa 
tttttttagg 
aagaggattt 



atataaaatt 



ggtaggagaa 
gtattttagt 
attatatttt 
ttttggagaa 
gggtgagagg 
atttggagtt 
taattttgat 
atataggtag 
aagatttttt 
ttggattttg 
tttttgagta 
tttagtagag 
tgatttgttt 
atttagagat 
tggaagggtg 
tttttttggt 
aatgtgagtg 
tttttataag 
ttgattttgt 
gatagatgag 
ttattttgta 
aggtaaaggg 
tttggatgga 
gtaattttgt 
ttgtgaataa 
ggaggagtat 
gattattttg 
gttttgagtt 
aggagttttg 
tttttttatt 
attttaaagg 
taatagttat 



agttaggtgt 
ttgtttaaat 
ttgggtaata 
aattgtatat 
ggtagtttat 
agtggagagt 
agttttttat 
ttaagttgta 
ttttttggtg 
ttttttttga 
gtttattgta 
gttgggatta 
atgggatttt 
gtttttattt 
ttttaatttg 
gagtgtggga 
tttattgtat 
ttatagatat 
gttttttgta 
tttggagttt 
ggagtagggt 
gttagtaagg 
ggaggggtgg 
gaagagtatt 
tgggttagta 
agtagtggat 
taggggtagg 
tttgtagaat 
tttatttaat 
ggttgggatg 
tttatttaga 
aggaagagtt 
tttagttgga 



ggtggtatat 
ttgggaggtg 
agagtgaaat 
tttgattata 
tttttttttt 
ggtgtatatt 
gttttatata 
taaaaggtag 
ggtttttgta 
gatgaagttt 
agttttgttt 
taggtgtgtg 
attatattgg 
tttaaagtgt 
attatttatt 
tagggtagtt 
tagttttatt 
agttaagttt 
gaatgatttt 
ttttaagtat 
ggagtgtttt 
atgtggggtt 
gaatttaagt; 
ggagtaaaaa 
aaaataaatg 
gtgtttgttt 
agaaatgtta 
agggagggta 
aatgagaaag 
tagtgatttt 
aaatagattt 
tttttttttg 
tgtatagtag 



gtttgtaatt 
gaggttgtag 
ttggttttaa 
gtattttagt 
tgttttggta 
agtagtttag 
tagtgagaaa 
ttttaggtta 
tttgtttgtg 
tattttgttg 
tttgggttta 
ttataataat 
ttaggttggt 
tgggattata 
ttttatttta 
agggttttga 
ttttgttgat 
agtgttgatt 
gtgtttagtt 
ttaaattatt • 
agtagaatgt 
aagaattaag 
ttattttttt 
ttttagtata 
gtgtgggttt 
agtatattaa 
gatagatttt 
tagggatggt 
tttgtttttt 
atttttagtt 
ggattagagg 
■ tttttaaagt 
gattatggaa 




tttttttttt- 
gttttttttt 
ttggtttatt 
tagttgggat 
atggggtttt 
tttggttttt 
tgtttttttt. 
tttttttttt 
tttagttttg 
ttttttagta 
ttagtataga 
gatttgtttg 
tttttttttt 
atgattttgg 
ttgggattat 
ggttttatta 
tggtttttta 
ttttgagata 
attgtaattt 
ggattatagg 
tttttttatt 
ggggtttata 
agt'attattt 
tatfctttttt 
agtttttgtt 
atggtatttg 
atagttttgg 
gttttttttt 
ttttttattt 



gtattatagg 
tgagatagag 
gtaatttttg 
tataggtgtt 
attattttgg 
taaagtgttg 
■ttttttttta 
ttttttttga 
atttattgta 
gttgggatta 
tggggttttg 
ttttggtttt 
tttttttgag 
tttattgtaa 
agttatatat 
tgttggttag 
aagtattggt 
gagttttatt 
ttgtttttta 
tgtgtatttt 
tttttatagt 
tttaatttaa 
attttgtaag 
atttaaataa. 
tttttttttt 
gtatatagtt 
gttagtaatt. 
tttttgtggg 
atttttaaat 



gatttat'ttt 
ttttgttttg 
ttttttgggt 
tgttattata 
ttaggttggt 
ggattaaagg 
atatataatt 
gatagagttt 
atttttgttt 
taggtatata 
ttatgttggt 
ttaaaatgtt 
atagagtttt 
tttttgtttt 
tattatgttt 
gttgattttg 
attagaggtg 
ttgttattta 
ggtttaagtg 
tgtggttagt 
ggaaaatggt 
tttttgtatt 
gaattttgta 
ggatttttgt 
ggtgtttagg 
ttttgataat 
tggatttttt 
taaagtgttt 
ttttttttag 



ttattttatt 
ttttttaggt 
ttaagtaatt 
attggttaat 
tttgaatttt 
tgtgagttat 
tgggattttt 
tgttttttta 
tttgggttta 
ttattatggt 
ttggtaggtt 
gagattatag 
attttgttat 
ttggttgaag 
ggttaatttt 
aatttttgat 
tgagttattg 
ggttggagtg 
atttttttgt 
tttttttttt 
ttaggattga 
taagttttgg 
aaattttatt 
tttatttttg 
tttggttttt 
ggggatattt 
tttttatttt 
ttgaatgttt 
atttatttta 



attgt'ttata 
tggagtgtag 
ttttgtttta 
tttttgtatt 
tgattttatg 
tgtatttggt 
tagtttttta 
tttaggttgg 
agtgatattt 
tagataattt 
ttgaattttt 
gtatgagtta 
ttaggttgga 
tgatttagtt 
tgtatgttta 
tgtaaatgat 
tatttggttt 
tagtggtatg 
tttatttttt 
aattggttag 
ttgatatgaa 
gttaagattt 
tgaagtatta 
ttaggtatat 
"tgagtttggt 
aggaggtttt 
tttaggtatt 
atggtttaaa 
gaggaaggga 



aaaattgatg 
tggtgtgatt 
gttttttaag 
tttagttgag 
atttatttat 
ttaaaattga 
tttttttttt 
aatgtagtgg 
ttgttttagt 
ttttgtattt 
ggttttaagt 
ttaagtttag 
gtgtagtggt 
ttttaagtag 
gtagagatag 
ttatttgttt 
ttttttttat 
attttggttt 
taagtagttg ' 
tgttttgtgg 
gataagttta 
tggtgtgttg 
tttataattt 
tgagttttat 
ggttatatta 
gagatatttt 
ttataattta 
ataagatttt 
atagaatttt 



2040 

2100 

2160 

2220 . 

2280 

2340 

2400 

2460 

2520 

2580 • 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 " 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
4080 
4140 ■ 
4200 
4260 
" 4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
- 5100 
- . 5160 
5220 
• 5280 
5340 
5400 
. 5460 . 
5520 
. 5580 
. 5640' 
5700 
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ttatatttta 
tatataggag 
tattttttaa 
gagggagtag 
tttagttata 
ttgtgtttaa 
tggaattggg 
ttatgtagtt 
ggtggagttt 
ttttggtgtg 
tttgagtttt 
gaaggtgttt 
agttgttaat 
tttgggttag 
tttgttttat 
attttattgt 
gttagtgttt 
tgggaagtgt 
ggagtatttg 
tttt'gtgttg 
tttgagaatg 
gtgtttatta 
taatagttta 
gagaggtttg 
ggaagtttta 
ggttatttta 
tggaaggtta 
tttttttttt 
ttatttaggt 
ttaagtgatt 
gtttagttaa 
tttaaatttt 
ggtttgagtt 
tatttgaagg 
ttafctttttt 
ttttttattt 
tttagtgatt 
tagtattatt 
tttatttttt 
atgatgaagt 
agatttatgg 
tattaggatg 
tagttttatt 
tagttattgg 
taaatatgtt 
tgttgtaatg 
attttgttgt 
ttgtgtttaa 
tattatgttt 
taggttggtt 
gggattatag 
attttgtgtt 
atgtatatat 
atgtttaatt 

* tagggtttat 

• ggattgtaaa 
gtttgttggt 
tt'agtatttg 

■ tttgtttgta 
taattttttt 
ttgggaagat 

. ttttaaagat 
atttgtggtt 



gtagttggtg 
tgtagatgga 
tttttggtga 
gagtatttga 
gttggataat 
tttgtttgtg 
gttttatgta 
aggagggtgg 
tagtgattgt 
gtttgtgagt 
tttggttgta 
ttgttttgtt 
tattgttttg 
agttgtgttg 
tggggtagga 
tttggagttt 
gggtaagtgt 
gtgtgtgtgg 
tggagttggt 
ttgggaaatt 
tttgtattgt 
tgtttgtgtt 
ggtagatgag 
gtggtgaggt 
gggagagggt 
tattgttatg 
taggtagaga 
ttttattggt 
tggagtg.taa 
tttttgtttt 
tttttgtatt 
tgattttagg 
attgtatttg 
aaaaattaaa 
tagattaaga 
attttagtta 
tgtgtattat 
tgaagtttgt 
gtagtttttg 
ttgggtttag 
ttgaaatatt 
tttaaaagat 
gttatatttg 
agataaaagt 
taaattattt 
ttttagtttt 
taaggttgga 
gtaatttttt' 
ggtttatttt 
ttgaattttt 
gtatgagtta 
ttttttagtt 
agatttggga 
tagattttat 
ttttagtagt 
agtatttttg 
gatttttagg 
atttattggg 
gggggtttat 
ttattttaaa 
ggattttgag 
aagaaattta 
tagatatgag 



ataggttaga 
ggagtaggat 
ggattttttt 
atgtggaaat 
gtttggttta 
atagaaatta 
tggttttgtg 
ggttgtttat 
tttttttttt 
tagtgaggtt 
gtagttattg 
tatagttgta 
gttgtgtgtg 
taattgttaa 
atgttggggt 
tatttataga 
ataagagtgg 
agtttgggtt 
aatttaggtt 
ggtaattaga 
ggtttgttta 
ttgttttatt 
tgtgtggtag 
ggaggagttt 
tgttgtttgt 
tggattttaa 
ggtttgttta 
aattaaaaaa 
tggtgtgatt 
agttttttga 
tttagtagag 
tgatttattt 
gttttttatt 
tatttttaat 
tattggtttt 
ggtttgtttt 
taaatggaat 
tttttttgat 
tttagttttt 
ttttgattat 
gatttaaatt 
gttttaagtg 
tttagttttt 
ttaaatttat 
tgttttttat 
gttttttttt 
gggtagtggt 
tattttagtt 
tttttatttt 
gattttaggt 
ttgtgtttgg 
gaatttgtga 
tattatttat 
gttataataa 
tgatttaata 
gtgattttag 
ggtgtagaag 
ttagaaaagg 
taaaggtttt 
ggttttggag 
gatagtaggt 
aaattagtgt 
tggagtgtga 



atagggaaga 
ttttttttgt 
ttttttatat 



tgaggtgtta 
ggtttattat 
agggggtttt 
ggtaggtttt 
tgatttagga 
gtttgttggt 
tgtgtggtga 
tttgttgtgt 
tttgtttgtt 
tgtgtgtatg 
gattgaaatg 
ggggatatgt 
ttttatttat 
tttttggttg 
agtttgggtt 
ttttttttag- 
atgttttttg 
tttttgtttt 
gtgggttttf 
tggtagtggt 
taggttgggg 
taggttttgt 
tgttggttat 
atagttggat 
ataataataa' 
ttggtttatt 
gtatttagga 
atggggtttt 
gttttggttt 
gggaatgtat 
ttatgtttgt 
tatatatttt 
tttttttagg 
ttagtgtttt 
ttttaggggt 
ttgtagattt 
tggtttggtt 
tttagattag 
aatatggtta 
tttttgtagt 
ttttgatttt 
ttttatggtt 
tttttttttt 



atgattttgg 
ttttgagta§ 
ttagtagata 
gatttatttg 
ttttttgttt 
tgtttttttg 
ttataagttt 
tttggggagt 
ggtttgtggt 
ttggtgttta. 
gattgttggg 
gtgtttgtta 
aaattttttt 
ggggttatga 
ataaattttt 
atttttatat 
taagggataa 



ggtgagggtt 
tttttaagtt 
ttttttttag 
gtttaaattg 
aagttatata 
ggtatttagt 
tggttaggat 
tgtggtaatg 
attttttggt 
agtattgttt 
tgttttaggt 
ttttagtttt 
tgtgttagtg 
tagattgttg 
atgttttgtt 
tatagggaat 
gaggtgaggg 
tgggtttgtt 
tttttgtgta 
tgtgtggtat 
ttttatatgt 
agtttaggtt 
aggtgaattt 
aaatgttttg 
aggtttgatt 
ttgggtgttt 
ttttattgtt 
aaaattgtgt 
gtaatttttg 
ttataggtgt 
attatgttag 
tttaaagtgt 
atggaatata 
tttgtggttg 
aattttttgt 
aaattgtttg 
attttttatt 
tatatttttt 
tgatttaatt 
tgtttatatt 
attttttgtg 
aaatttaatt 
aaaaatggtt 
ttttttgttt 
attttatttt 
tttttttgag 
tttattgtaa 
ttgggattat 
tgaggtttta 
tttttgtttt 
attttttgta 
tattggatga 
ttaaataggt 
ttttaaaatt 
gggatttagg 
tttaggggag 
tgtgtggttg 
aataaagatt 
aggttttttt 
gtgtttgaga 
ttttaagaag 
ataagtagtt 
attatttttg 



tagttggttt 
tttttaaata 



tttttttaag 
tttggttggt 
gttgtttttt 
atttaggtgg 
ttgtggggag 
gattggggag 
ttttatttgg 
gagttttgag 
tattttgaaa 
gtgtgtttgt 
tgtgtgtgtg 
ggatttagtt 
tttaggaatg 
gggggtgggt 
tgggaaggtg 
tgtagtgggt 
gaattagttt 
ttaggtagtt 
ttttggtttt 
ttggggtttg 

gtaatttgta 
gagattgaag 
tattttagtg 
ggaaattggt 
tagtatagaa 
tttgtttttg 
ttttttgggt 
ttgttattat 
ttaggttggt 
tgggattata 
tttgtttatt 
ttatttgttt 
ttttattttt 
ggatagggtt 
tttatttttt 
tagttttttt 
tttatatttt 
tatttgtttt 
ttagtatttt . 
tttttttttt 
attaggtttt 
ttatttttga 
tttttgagaa 
atagagtttt 
tttttgtttt 
aggtatttgt' 
ttatgttggt 
ttgaagtgtt 
ttttgttata 
gagggttttt 
tagagtagtg 
tattgatgtt 
ttagtgggga 
agtaattttt 
tgtgtatatt 
taataaaatt 
ttataggtgg 
agaggtaagt 
ggttaaggta 
atttttgttt 
tgtatttttt 



5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

'7320 

7380 

7440 

7500 

7560 

7620 

7680 

■ 7740 
7800 
7860 
7920 
7980 
8040 
8100 . 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
888 0 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480' 
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agtgatgggg 
ttgtgatttg 
gtgtggggtt 
ttgttataga 
tggttaggga 
ttggtatgtt 
agtttaatga 
gggtgttagt 
tttaggattt 
gtgtagggat 
tttgtttttt 
tggtgattgt 
tagaggaaga 
atatttggtt 
gagggagatt 
ttggtagagg 
gtatgggtgg 
taggatataa 
tgggtattga 
ttttaggata 
aagtttttgt 
ttagtttgta 
ggtgggggta 
ttataggttt 
ttggagttgg 
ttagattttt 



tgaaagtaat 
atttgtggtg 
attatttatg 
ggttgtatta 
ttagttgttg 
ttagttgggt 
gtttaggttt 
aagtgtgggt 
agattgggtg 
tgtggggttt 
tttttttttt 
aaagtattat 
gggtttttat 
tatttattgg 
ggttagtagt 
ttggattgtg 
ttggttttta 
gtttttttat 
ggatggtaat 
tttttatttg 
attgtagtaa 
ggttttagtt 
ttttatttat 
tatttttgtg 
tgtttggtaa 
gggatttagg 



ggatttagtt 

attgtgttgt 

tgtgtattgt 

tgtgtgtgtg 

tgtggggttg 

gtggtttttt 

ttttgatatg 

tgggtggggt 

gtgggaagga 

gaaggtgggg 

ttgttttttt 

gtggagatat 

atttggtttt 

gtggggtagg 

tttattgttt 

attttgaggt 

gtaattgtag 

gtttttttta 

gtatgtgggg 

tagtaatatt 

gtatagtggg 

tatttggggg 

atttatgttt 

tttgtagttg 

ttgtttgtta 



tttgggagtt 
tttttggttg 
aggtttttgt 
gtgggttttg 
tatgtggtgt 
ttagtgtgtt 
gtttggttgg 
tatagggtgg 
gttgaggaga 
ttgggtgtgt 
tttatatttt 
ttgtgttttt 
ggtttttttg 
aagttttgag 
gttttgtttt 
ttaggggtgt 
tttttatttg 
gtgtttgatt 
gatgtgggag 
atttattttg 
gttgttttgg 
aaagttagga 
tttgtatttt 
taggttttgt 
tttgtttttg 



gtttttgttg 

ttttttttgt 

gtatgatgtt 

tgggttggaa 

tttgtgtgat 

tagtgggtgt 

gtttgtgttt 

gttttgattt 

gttgtgtaat 

ttttgtagag 

attttggatg 

ggaggttagt 

gtttggtttg 

tttttatttg 

ttattgtgga 

attttgggtg 

gttttgttat 

tggtattttt 

tagggtttag 

gtattgtgag 

agttattgtt 

aggtttgatt 

ttttattttt 

tttgaggggt 

tttttttgtt 



attttttttg 

ttttgtaggt 

ttagatgaag 

gtggtggtta 

gtgtagtgtg 

tagtttttgt . 

tgttggtttt 

tagtgttttt 

ggaaatttgg 

ttttttttgt 

gttataatga 

tttattgtgt 

ttgaagtaat 

gggtgaggag 

gattggggtt 

gatttttttg 

tttgggttgt 

tgtaggtagg 

aggtttaagg 

tagtgtttag 

tttagtatat 

ggttttggaa 

tttgttattt 

tgaatatatg 

ttagttgttt 



<210> 25 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> chemically treated . genomic DNA (Homo sapiens) 



<4O0> 25 

ttttagattt 

attgttaagt 

gtgtgaaggt 

tgtgggtggg 

gggttgagat 

tttattgtag 

gtaggtgagg 

tattgttatt 

tatggaaaga* 

ttgagggttg 

ttatggttta 

agttgttgat 

tttagtgggt 

tgtggggatt 

tgtggtgttt 

aaggagggaa 

tgggttttat 

ttgtttagtt 

ggtttgtgtt 

ggagtttgag 

tatttggttg 

atggtagttg 

gtggtgtgat 

gtgtgagtag 

ttgttgtggg 

tgttattttt 



tagaaatttg 

attagtttta 

aaggtttgtg 

gtgtttttat 

ttataggttg 

tgtagaggtt 

gtattttaga 

tttaatattt 

tttgtgtttt 

attatttatg 

gtttttgttg 

tgattttttt 

aggttaagtg 

tttttttttt 

tgtggttgtt 

ggaagggtaa 

agtttttgta 

tggattttgg 

tgttagtgtt 

tttattgagt 

gagtgtgtta 

gtttttggtt 

ttttgtggtg 

tggttttgtg 

ttaagttgtg 

gttttattgt 



ggagtggttg 

gtatgtgttt 

gaaatggtag 

•tttttagggt 

gatgtgttag 

tttaagtgtt 

attttggatt 

atttgtttgt 

ggtagtttag 

ttaagggaat 

gagttttagt 

tttttttatt 

tgttgtttta 

agtatagtaa 

gttgttgtgg 

ggtggggggg 

tttaggtttt 

gggaggtgtt 

taaagttagt 

tgtgggagtt 

atgtgttgtg 

gtggttattg 

attttatttg 

tgtttatgag 

gtagaggggg 

tgaagagtgt 



gagtgagaaa 

agttttttag 

ggagggtgga 

tagttagatt 

aggtagtggt 

gtttatgatg 

tttaagtttt 

agggagtgtt 

ggtgatagag 

ttatttagaa 

ttttgtagtg 

ttaagtgaa'g 

gtaaattgga 

agttggtttt 

ttgtttgggg 

gttttgtgag 

tattgtgtgg 

gaagttgggg 

gaagtatggg 

ggtatttgtt 

tattgtgtgg 

tttttagttt 

gggtgttgtg 

agtggaaggg 

ttggtgggga 

gtgaaaatgg 



atagaggtaa 
agtaggattt 
ggggatgtag 
tttttgattt 
tttagagtgg 
ttagaatgag 
atttttatat 
aagttaagta 
ttaaatgagg 
tgtatttttg 
gagagtagag 
gtttgagatt 
ttaggagggt 
tagaaatatg 
tggggtgtga 
agtgtgttta 
ttttttttag 
tttgttttgt 
' tttaattggg 
gggtgtgttg 
ggtattgtgt 
gtggggtttg 
tgtaaaggtt 
gtagttaagg 
tagtttttga 
tttatttttt- 



gtggtaggta 

gtggttgtag 

gaggtatgga 

ttttttaggt 

ttttgttgtg 

tggtattgtt 

tttttatatg 

ttgggaaaag 

gttgtagttg 

aattttaaga 

t'gggtggtaa 

ttttgtttta 

tagggttgga 

ggtatttttg 

ggaggggatg 

gttttgtttt 

tttttttttg 

ggttttgttt ■ 

ttatgttggg 

ggaagggttg 

gtaattttat 

ttatgtatat 

tgtagtgtgt 

ggtagtgtag 

ggattaggtt 

gttgtatttt 



9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11001 



60 
120 
180 
..24 0- 
300 
360 
420 
480 
. 540 
600 
660 
720 
780 
840 
900 
960. 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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atttgtattt gggttataga tgagtagagg tggttgttta tatgtaaaaa tatgttgatt 1620 

ttaagttttt tatttttaaa atgttttggt tttttttgag aaagggtttg tgtttattgt 1680 

ttttggagtt tattttttta ggtttgtttt tttttaaata tttatgattt tttttagaat 1740 

ttttagggtg aagggaaatt attatttatg ggagggagtt tggaaaaatt tagaattttt 1800 

ggtgggtttt . ttgtaagtag gagttttgtt gagtttttat ttagtaaata tttttttttg 1860 

atttagtgaa ttagatgtta aaatatgtat gtagttatat atttagtagt ttttttgtat 1920 

ttttgggaat tgttagtaag taaaggttgt ttttttttgg gtagatatta gttggaatta 1980 

ttaggggtgt ttttatagtt ttttttgtta gtttggattt tattgtagat ttgttgaatt 2040 

aattgttggg agtggatttt aggtattagt aaattttaaa aattttttaa attattgtaa 2100 
tatggagttt gggttgagta ttattgtttt ggtttattta ggaatttgtg gatggatagt- 2160" 

gttttaggtt tgtgtgtgta tggagatttt tttatttggt ataagaggat attataaatt 2220 

tagttggggg gagtataaag ttgtgataga atgtaaagaa tgaataaggg gttgagtgtg 2280 
gtggtttatg tttgtaattt tagtattttg gaaggtggag gtgggtggat tatttgaggt ' 2340 

taggagttta agattagttt ggttaatatg gtgaaatttt atgtttatta aaaaataaaa 2400 

aaaaatgagt taggtgtagt ggtgggtgtt tgtaatttta gttatttggg aggttgaggt 24 60 

gggagaattg tttgaatata ggaggtggag gttgtagtga gttgagattg tgttattgtt 2520 

ttttagtttt ggtgata.gag tgagattttg ttttaaaaaa aaaaaaaaaa aaaaaaagaa 2580 

i ' taaggttggg atattgtagt gtttttaaag agaaataaag tagttatgga gataagaagt 2640 

aggatgattt gggtatgttt attagaggta gagataaggg agaaattaaa gataagtttg 27 00 

ggtttttgtt tttagtaatt gggagtttag tggttatttt tgttgtaaag aggaagttgg 27 60 

gtaagtgtag tagtgaggtt gaagaaaagg gaattaaatt ttggttatgt ttatttgaaa 2820 

^ tgttttttag atattttagt gaaggtattg gtatggagga tttagtttga gggtttaggt 2880 

■ tagtgtttta gttgtggatt tggggtagat gaatgtagat agattaggtt agtgattagg 2940 

attgagttta gattttattg tgagatatgg aagttgagtt agaatttgta aaggagttga 3000 

gtaggagttg' tagggggtag gaggaaaatt gggagagtgt agtttttggg agttaaaggg 3060 

agtaagtttt aaatgatgtt gagggggtga gaatggagaa tggaatattg gattttattt 3120 

ggtagtatat agattgttga ggattttgtt ttgggtagtt ttttggagga agaggtaagt 3180 

ttggttggag tgggtagagg ggagagtgaa ggtgaaggat tagagtgtat * agagattagt 3240 

gttttggttt gaggggagta gagataggtg ataattatag ggtagatgta ggttaaaggt 3300 

gtttagtttt ttttttaagt aaatgggtag atgtatttta tatatgtttt tagtgaaggg 3360 

• ttgggtgtgg tggtttaagt ttgtagtttt agtattttgg aaggttgagg tgggtggatt 3420 

atttgagatt aggagtttga gattagtttg gttaatatgg tgaaattttg tttttattaa 34 80 

aaatataaaa attagttggg tatggtggtg ggtgtttgta attttaggta tttaggaggt 3540 

tgaggtagaa gaattgtttg aatttaggag gtggaggttg tggtgagttg aaattgtgtt 3600 

attgtatttt agtttgggtg ataaaagtaa gatgtagttt tttgttgttg tttttttaat 3660 
tgttaatgag gaaaggggaa gttttgtgtt aggtgataga gatttaattg ttgagtaggt ■ 3720 

ttttttgttt gtggtttttt' ggttggtttt tagatgttta ggtggttaat attagagttt 3780 
gtgtagtagt gtgaggtaat ttattgagat aggttgggtt tgtggagttt ggtgagtagt . 3840 

ggtttttttt ttggggtttt tttttaattt ttgggatatt tttttgattt ggagtttttt 3900 
tgttttattg ttaggttttt ttgtagattg taagtttatt tgttattatt gttgttgtgt * ' 3960 
' '' _ ' gtttgtttg't ttggattgtt • gtgggttttg -ggatttg^ggt tgggaatttg tggtggagtg * 4020 

ggatatgaat gtggtgagtg tggggttgag ggtgtatggg aagggtgagg atgggtaggt 4080 

^ ^^^J tatagtgtag gtatttttga gggttgtttg ggtgttgtgt gtaaggagtg ttttaattgt 4140 

^^^F tgattttttg gtggtataga gaggttaatt ttgtgtgggg gttgggaggg gagtttggat 4200 

tgttggtttt gtaagtattt tatttgttgt aagtggattt gggtttaggt tgatttaggt 4260- 

tttgtgtatg tgtatttttt gtattttttt gtttttgttt ttggttagag gttatttttg 4320 

tgtgtttgtt tggatgttgg tatttgtttt tgttttttgt ggtaggtggg gtttgtgagt 4380 

ggagttttgg agtgatgagg ttatttttgg gggtgaagtg tgtgtgtttt tgttttggtg 4440 

tttttgtttt aatgagataa gagttagatt ttggtgattt atgttttagt tttaatggtt 4500 

gtggtgtggt tttggtttgg gtgtatgtgt atattgatat gtgtatatgt atgtatgtga 4560 
ttggggtggt ggttggtggt tatggatgtg taggattggg ggatgggtgg gtatggttat • • 4 620 

gggtgaggtg gaggtgtttt tttttgaaat gatttggagt agtatgatga gtagtggtta 4 680 

ttgtagttaa gaggatttgg atttggagtt tga.gtagtat tttattgtgt gaattttgtt 4740 

agtttgtagg ttgtgttggg attaggtggg agttaggggg tgttggtggg tgggagggga 4 800 

agtggttgtt ggagttttgt tttttttggt tt'gttgttgt gttttgggtt ggtgggtagt 4860 

tttatttttt tggttatgtg gttttttgtg ggttttggtt ggggatttgt ttgtggaatt 4 920 

gtgtgtaaga ttttgatttt attgtttaga tgttgggtgt tggggttttt ttggtttttg 4 980 

ttatagatag gttgaatatg gaaaaagtag ttgtatggtt tgtggtagat .ttgagttggg 5040 

tattatttag ttatgattaa agttgattga gtagtttgga ttagtatttt gatttttgtg 5100 

tttgaatgtt tttgtttttt ttttggggag attaggggag gatgtggaga gggaagagtt 5160 

tttgttagga attgagaagt atgtttagga aaatttgaga ggtagagaga gattttgttt 5220. 
ttttatttgt atttttgtat ggagttagtt gagtttttat tttttttttg ttttggtttg* ' 5280 
ttattagttg ttggaatgtg gaagattttg tttttttttt ttagggtgga tttggagaaa ..5340 
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